

MAY 0 4 


SEQUENCE LISTING 




GENERAL INFORMATION: 

RECF'VBB 

(i) APPLICANT: EPELBAUM, SABINE URSULA 

FALCO, SAVERIO CARL (yj^y J ? ?0OO 

MCDEVITT, RAYMOND ERVIN, III 

(ii) TITLE OF INVENTION: CHIMERIC GENES AND METHODS pJIP* 68418116O(V210 ® 

INCREASING THE LYSINE CONTENT OF 
THE SEEDS OF PLANTS 


(iii) NUMBER OF SEQUENCES: 132 


(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: E. I. DU PONT DE NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19898 


(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MICROSOFT OFFICE 97 

(D) SOFTWARE: MICROSOFT WINDOWS 95 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 


(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/824,627 

(B) FILING DATE: MARCH 27, 1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CHRISTENBURY, LYNNE M. 

(B) REGISTRATION NUMBER: 30,971 

(C) REFERENCE/DOCKET NUMBER: BB-1037-F 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 302-992-5481 

(B) TELEFAX: 302-892-7949 

(C) TELEX: 835420 
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( 2 ) 


INFORMATION FOR SEQ ID NO:l: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1350 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 


RECEIVED 

vjjr jg ZCQO 

CENTER 1600/2300 


ATG GCT GAA ATT GTT GTC TCC AAA TTT GGC GGT ACC AGC GTA GCT GAT 48 

Met Ala Glu lie Val Val Ser Lys Phe Gly Gly Thr Ser Val Ala Asp 

15 10 15 

TTT GAC GCC ATG AAC CGC AGC GCT GAT ATT GTG CTT TCT GAT GCC AAC 96 

Phe Asp Ala Met Asn Arg Ser Ala Asp lie Val Leu Ser Asp Ala Asn 

20 25 30 

GTG CGT TTA GTT GTC CTC TCG GCT TCT GCT GGT ATC ACT AAT CTG CTG 144 
Vsl Arg Leu Val Val Leu Ser Ala Ser Ala Gly lie Thr Asn Leu Leu 
35 40 45 

GTC GCT TTA GCT GAA GGA CTG GAA CCT GGC GAG CGA TTC GAA AAA CTC 192 

Val Ala Leu Ala Glu Gly Leu Glu Pro Gly Glu Arg Phe Glu Lys Leu 

50 55 60 

GAC GCT ATC CGC AAC ATC CAG TTT GCC ATT CTG GAA CGT CTG CGT TAC 240 
Asp Ala lie Arg Asn lie Gin Phe Ala lie Leu Glu Arg Leu Arg Tyr 
65 70 75 .80 

CCG AAC GTT ATC CGT GAA GAG ATT GAA CGT CTG CTG GAG AAC ATT ACT 288 
Pro Asn Val lie Arg Glu Glu lie Glu Arg Leu Leu Glu Asn lie Thr 
85 90 95 

GTT CTG GCA GAA GCG GCG GCG CTG GCA ACG TCT CCG GCG CTG ACA GAT 336 
Val Leu Ala Glu Ala Ala Ala Leu Ala Thr Ser Pro Ala Leu Thr Asp 
100 105 110 

GAG CTG GTC AGC CAC GGC GAG CTG ATG TCG ACC CTG CTG TTT GTT GAG 38 4 
Glu Leu Val Ser His Gly Glu Leu Met Ser Thr Leu Leu Phe Val Glu 
115 120 125 

ATC CTG CGC GAA CGC GAT GTT CAG GCA CAG TGG TTT GAT GTA CGT AAA 432 
lie Leu Arg Glu Arg Asp Val Gin Ala Gin Trp Phe Asp Val Arg Lys 
130 135 140 

GTG ATG CGT ACC AAC GAC CGA TTT GGT CGT GCA GAG CCA GAT ATA GCC 480 
Val Met Arg Thr Asn Asp Arg Phe Gly Arg Ala Glu Pro Asp lie Ala 
145 150 155 160 
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GCG 

CTG 

GCG 

GAA 

CTG 

GCC 

GCG 

CTG 

CAG 

CTG 

CTC 

CCA 

CGT 

CTC 

AAT 

GAA 

528 

Ala 

Leu 

Ala 

Glu 

Leu 

Ala 

Ala 

Leu 

Gin 

Leu 

Leu 

Pro 

Arg 

Leu 

Asn 

Glu 






165 





170 





175 



GGC 

TTA 

GTG 

ATC 

ACC 

CAG 

GGA 

TTT 

ATC 

GGT 

AGC 

GAA 

AAT 

AAA 

GGT 

CGT 

576 

Gly 

Leu 

Val 

lie 

Thr 

Gin 

Gly 

Phe 

lie 

Gly 

Ser 

Glu 

Asn 

Lys 

Gly 

Arg 





180 





185 





190 




ACA 

ACG 

ACG 

CTT 

GGC 

CGT 

GGA 

GGC 

AGC 

GAT 

TAT 

ACG 

GCA 

GCC 

TTG 

CTG 

624 

Thr 

Thr 

Thr 

Leu 

Gly 

Arg 

Gly 

Gly 

Ser 

Asp 

Tyr 

Thr 

Ala 

Ala 

Leu 

Leu 




195 





200 





205 





GCG 

GAG 

GCT 

TTA 

CAC 

GCA 

TCT 

CGT 

GTT 

GAT 

ATC 

TGG 

ACC 

GAC 

GTC 

CCG 

672 

Ala 

Glu 

Ala 

Leu 

His 

Ala 

Ser 

Arg 

Val 

Asp 

lie 

Trp 

Thr 

Asp 

Val 

Pro 



210 





215 





220 






GGC 

ATC 

TAC 

ACC 

ACC 

GAT 

CCA 

CGC 

GTA 

GTT 

TCC 

GCA 

GCA 

AAA 

CGC 

ATT 

720 

Gly 

lie 

Tyr 

Thr 

Thr 

Asp 

Pro 

Arg 

Val 

Val 

Ser 

Ala 

Ala 

Lys 

Arg 

lie 


225 





230 





235 





240 


GAT 

GAA 

ATC 

GCG 

TTT 

GCC 

GAA 

GCG 

GCA 

GAG 

ATG 

GCA 

ACT 

TTT 

GGT 

GCA 

768 

Asp 

Glu 

lie 

Ala 

Phe 

Ala 

Glu 

Ala 

Ala 

Glu 

Met 

Ala 

Thr 

Phe 

Gly 

Ala 






245 





250 





255 



AAA 

GTA 

CTG 

CAT 

CCG 

GCA 

ACG 

TTG 

CTA 

CCC 

GCA 

GTA 

CGC 

AGC 

GAT 

ATC 

816 

Lys 

Val 

Leu 

His 

Pro 

Ala 

Thr 

Leu 

Leu 

Pro 

Ala 

Val 

Arg 

Ser 

Asp 

lie 





260 





265 





270 




CCG 

GTC 

TTT 

GTC 

GGC 

TCC 

AGC 

AAA 

GAC 

CCA 

CGC 

GCA 

GGT 

GGT 

ACG 

CTG 

864 

Pro 

Val 

Phe 

Val 

Gly 

Ser 

Ser 

Lys 

Asp 

Pro 

Arg 

Ala 

Gly 

Gly 

Thr 

Leu 




275 





280 





285 





GTG 

TGC 

AAT 

AAA 

ACT 

GAA 

AAT 

CCG 

CCG 

CTG 

TTC 

CGC 

GCT 

CTG 

GCG 

CTT 

912 

Val 

Cys 

Asn 

Lys 

Thr 

Glu 

Asn 

Pro 

Pro 

Leu 

Phe 

Arg 

Ala 

Leu 

Ala 

Leu 



290 





295 





300 






CGT 

CGC 

AAT 

CAG 

ACT 

CTG 

CTC 

ACT 

TTG 

CAC 

AGC 

CTG 

AAT 

ATG 

CTG 

CAT 

960 

Arg 

Arg 

Asn 

Gin 

Thr 

Leu 

Leu 

Thr 

Leu 

His 

Ser 

Leu 

Asn 

Met 

Leu 

His 


305 





310 





315 





320 


TCT 

CGC 

GGT 

TTC 

CTC 

GCG 

GAA 

GTT 

TTC 

GGC 

ATC 

CTC 

GCG 

CGG 

CAT 

AAT 

1008 

Ser 

Arg 

Gly 

Phe 

Leu 

Ala 

Glu 

Val 

Phe 

Gly 

lie 

Leu 

Ala 

Arg 

His 

Asn 






325 





330 





335 



ATT 

TCG 

GTA 

GAC 

TTA 

ATC 

ACC 

ACG 

TCA 

GAA 

GTG 

AGC 

GTG 

GCA 

TTA 

ACC 

1056 

He 

Ser 

Val 

Asp 

Leu 

lie 

Thr 

Thr 

Ser 

Glu 

Val 

Ser 

Val 

Ala 

Leu 

Thr ' 





340 





345 





350 




CTT 

GAT 

ACC 

ACC 

GGT 

TCA 

ACC 

TCC 

ACT 

GGC 

GAT 

ACG 

TTG 

CTG 

ACG 

CAA 

1104 

Leu 

Asp 

Thr 

Thr 

Gly 

Ser 

Thr 

Ser 

Thr 

Gly 

Asp 

Thr 

Leu 

Leu 

Thr 

Gin 




355 





360 





365 





TCT 

CTG 

CTG 

ATG 

GAG 

CTT 

TCC 

GCA 

CTG 

TGT 

CGG 

GTG 

GAG 

GTG 

GAA 

GAA 

1152 

Ser 

Leu 

Leu 

Met 

Glu 

Leu 

Ser 

Ala 

Leu 

Cys 

Arg 

Val 

Glu 

Val 

Glu 

Glu 



370 





375 





380 
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GGT 

CTG 

GCG 

CTG 

GTC 

GCG 

TTG 

ATT 

GGC 

AAT 

GAC 

CTG 

TCA 

AAA 

GCC 

TGC 

1200 

Gly 

Leu 

Ala 

Leu 

Val 

Ala 

Leu 

He 

Gly 

Asn 

Asp 

Leu 

Ser 

Lys 

Ala 

Cys 


385 





390 





395 





400 


GCC 

GTT 

GGC 

AAA 

GAG 

GTA 

TTC 

GGC 

GTA 

CTG 

GAA 

CCG 

TTC 

AAC 

ATT 

CGC 

1248 

Ala 

Val 

Gly 

Lys 

Glu 

Val 

Phe 

Gly 

Val 

Leu 

Glu 

Pro 

Phe 

Asn 

lie 

Arg 






405 





410 





415 



ATG 

ATT 

TGT 

TAT 

GGC 

GCA 

TCC 

AGC 

CAT 

AAC 

CTG 

TGC 

TTC 

CTG 

GTG 

CCC 

1296 

Met 

lie 

Cys 

Tyr 

Gly 

Ala 

Ser 

Ser 

His 

Asn 

Leu 

Cys 

Phe 

Leu 

Val 

Pro 





420 





425 





430 




GGC 

GAA 

GAT 

GCC 

GAG 

CAG 

GTG 

GTG 

CAA 

AAA 

CTG 

CAT 

AGT 

AAT 

TTG 

TTT 

1344 

Gly 

Glu 

Asp 

Ala 

Glu 

Gin 

Val 

Val 

Gin 

Lys 

Leu 

His 

Ser 

Asn 

Leu 

Phe 



435 440 445 


GAG TAA 1350 

Glu * 

450 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG 36 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GTACCGCCAA ATTTGGAGAC AACAATTTCA GCCATG 36 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 


CCCGGGCCAT GGCTACAGGT TTAACAGCTA AGACCGGAGT AGAGCACT 48 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GATATCGAAT TCTCATTATA GAACTCCAGC TTTTTTC 37 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 



(A) 

LENGTH: 

917 base pairs 


(B) 

TYPE: nucleic acid 


(C) 

STRANDEDNESS: single 


(D) 

TOPOLOGY: 

linear 

(ii) 

MOLECULE TYPE: 

DNA (genomic) 

(ix) 

FEATURE: 



(A) 

NAME/KEY: 

CDS 


(B) 

LOCATION: 

3..911 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 


CC ATG GCT ACA GGT TTA ACA GCT AAG ACC GGA GTA GAG CAC TTC GGC 47 

Met Ala Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly 
15 10 15 


ACC 

GTT 

GGA 

GTA 

GCA 

ATG 

GTT 

ACT 

CCA 

TTC 

ACG 

GAA 

TCC 

GGA 

GAC 

ATC 

95 

Thr 

Val 

Gly 

Val 

Ala 

20 

Met 

Val 

Thr 

Pro 

Phe 

25 

Thr 

Glu 

Ser 

Gly 

Asp 

30 

lie 


GAT 

ATC 

GCT 

GCT 

GGC 

CGC 

GAA 

GTC 

GCG 

GCT 

TAT 

TTG 

GTT 

GAT 

AAG 

GGC 

143 

Asp 

lie 

Ala 

Ala 

35 

Gly 

Arg 

Glu 

Val 

Ala 

40 

Ala 

Tyr 

Leu 

Val 

Asp 

45 

Lys 

Gly 


TTG 

GAT 

TCT 

TTG 

GTT 

CTC 

GCG 

GGC 

ACC 

ACT 

GGT 

GAA 

TCC 

CCA 

ACG 

ACA 

191 

Leu 

Asp 

Ser 

50 

Leu 

Val 

Leu 

Ala 

Gly 

55 

Thr 

Thr 

Gly 

Glu 

Ser 

60 

Pro 

Thr 

Thr 


ACC 

GCC 

GCT 

GAA 

AAA 

CTA 

GAA 

CTG 

CTC 

AAG 

GCC 

GTT 

CGT 

GAG 

GAA 

GTT 

239 

Thr 

Ala 

65 

Ala 

Glu 

Lys 

Leu 

Glu 

70 

Leu 

Leu 

Lys 

Ala 

Val 

75 

Arg 

Glu 

Glu 

Val 


GGG 

GAT 

CGG 

GCG 

AAG 

CTC 

ATC 

GCC 

GGT 

GTC 

GGA 

ACC 

AAC 

AAC 

ACG 

CGG 

287 

Gly 

80 

Asp 

Arg 

Ala 

Lys 

Leu 

85 

He 

Ala 

Gly 

Val 

Gly 

90 

Thr 

Asn 

Asn 

Thr 

Arg 

95 
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ACA TCT GTG GAA CTT GCG GAA GCT GCT GCT TCT GCT GGC GCA GAC GGC 335 
Thr Ser Val Glu Leu Ala Glu Ala Ala Ala Ser Ala Gly Ala Asp Gly 
100 105 110 

CTT TTA GTT GTA ACT CCT TAT TAC TCC AAG CCG AGC CAA GAG GGA TTG 383 
Leu Leu Val Val Thr Pro Tyr Tyr Ser Lys Pro Ser Gin Glu Gly Leu 
115 120 125 

CTG GCG CAC TTC GGT GCA ATT GCT GCA GCA ACA GAG GTT CCA ATT TGT 431 
Leu Ala His Phe Gly Ala lie Ala Ala Ala Thr Glu Val Pro lie Cys 
130 135 140 

CTC TAT GAC ATT CCT GGT CGG TCA GGT ATT CCA ATT GAG TCT GAT ACC 479 
Leu Tyr Asp lie Pro Gly Arg Ser Gly lie Pro lie Glu Ser Asp Thr 
145 150 155 

ATG AGA CGC CTG AGT GAA TTA CCT ACG ATT TTG GCG GTC AAG GAC GCC 527 
Met Arg Arg Leu Ser Glu Leu Pro Thr lie Leu Ala Val Lys Asp Ala 
160 165 170 175 

AAG GGT GAC CTC GTT GCA GCC ACG TCA TTG ATC AAA GAA ACG GGA CTT 575 
Lys Gly Asp Leu Val Ala Ala Thr Ser Leu lie Lys Glu Thr Gly Leu 
180 185 190 

GCC TGG TAT TCA GGC GAT GAC CCA CTA AAC CTT GTT TGG CTT GCT TTG 623 
Ala Trp Tyr Ser Gly Asp Asp Pro Leu Asn Leu Val Trp Leu Ala Leu 
195 200 205 

GGC GGA TCA GGT TTC ATT TCC GTA ATT GGA CAT GCA GCC CCC ACA GCA 671 
Gly Gly Ser Gly Phe lie Ser Val He Gly His Ala Ala Pro Thr Ala 
210 215 220 

TTA CGT GAG TTG TAC ACA AGC TTC GAG GAA GGC GAC CTC GTC CGT GCG 719 
Leu Arg Glu Leu Tyr Thr Ser Phe Glu Glu Gly Asp Leu Val Arg Ala 
225 230 235 

CGG GAA ATC AAC GCC AAA CTA TCA CCG CTG GTA GCT GCC CAA GGT CGC 767 
Arg Glu lie Asn Ala Lys Leu Ser Pro Leu Val Ala Ala Gin Gly Arg 
240 245 250 255 

TTG GGT GGA GTC AGC TTG GCA AAA GCT GCT CTG CGT CTG CAG GGC ATC 815 
Leu Gly Gly Val Ser Leu Ala Lys Ala Ala Leu Arg Leu Gin Gly lie 
260 265 270 

AAC GTA GGA GAT CCT CGA CTT CCA ATT ATG GCT CCA AAT GAG CAG GAA 863 
Asn Val Gly Asp Pro Arg Leu Pro lie Met Ala Pro Asn Glu Gin Glu 
275 280 285 

CTT GAG GCT CTC CGA GAA GAC ATG AAA AAA GCT GGA GTT CTA TAA TGAGAATTC 

Leu Glu Ala Leu Arg Glu Asp Met Lys Lys Ala Gly Val Leu * 

290 295 300 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CTTCCCGTGA CCATGGGCCA TC 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CATGGCTGGC TTCCCCACGA GGAAGACCAA CAATGACATT ACCTCCATTG CTAGCAACGG 
TGGAAGAGTA CAATG 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CATGCATTGT ACTCTTCCAC CGTTGCTAGC AATGGAGGTA ATGTCATTGT TGGTCTTCCT 
CGTGGGGAAG CCAGC 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

CATGGCTTCC TCAATGATCT CCTCCCCAGC TGTTACCACC GTCAACCGTG CCGGTGCCGG 
CATGGTTGCT CCATTCACCG GCCTCAAAAG 



(2) INFORMATION FOR SEQ ID NO:11: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

CATGCTTTTG AGGCCGGTGA ATGGAGCAAC CATGCCGGCA CCGGCACGGT TGACGGTGGT 60 
AACAGCTGGG GAG GAGAT CA TTGAGGAAGC 90 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

CCGGTTTGCT GTAATAGGTA CCA 23 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

AGCTTGGTAC CTATTACAGC AAACCGGCAT G 31 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

GCTTCCTCAA TGATCTCCTC CCCAGCT 27 
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(2) INFORMATION FOR SEQ ID NO:15: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
CATTGTACTC TTCCACCGTT GCTAGCAA 


(2) INFORMATION FOR SEQ ID NO:16: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
70" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 


CTGACTCGCT GCGCTCGGTC 


(2) INFORMATION FOR SEQ ID NO:17: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
71" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 


TATTTTCTCC TTACGCATCT GTGC 



( 2 ) 


INFORMATION FOR SEQ ID NO:18: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: 

linear 

(ii) 

MOLECULE TYPE: 

DNA (genomic) 

(ix) 

FEATURE: 



(A) NAME/KEY: 

misc__feature 


(B) LOCATION: 

1. .27 


(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
78" 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:18: 

TTCATCGATA GGCGACCACA CCCGTCC 27 

(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
7 9" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

AATATCGATG CCACGATGCG TCCGGCG 27 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..55 
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(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
81" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CATGGAGGAG AAGATGAAGG CGATGGAAGA GAAGATGAAG GCGTGATAGG TACCG 55 
(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..55 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
80" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

AATTCGGTAC CTATCACGCC TTCATCTTCT CTTCCATCGC CTTCATCTTC TCCTC 55 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..14 

(D) OTHER INFORMATION: /label= name 

/note= "base gene 
[(SSP5)2]" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= 

"synthetic 
oligonucleotide" 
/standard_name= "SM 
84" 


<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:23: 


GATGGAGGAG AAGATGAAGG C 


(2) INFORMATION FOR SEQ ID NO:24: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
85" 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:24: 


ATCGCCTTCA TCTTCTCCTC C 


(2) INFORMATION FOR SEQ ID NO:25: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name— "SM 
82" 


21 


21 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 


GATGGAGGAG AAGCTGAAGG C 


(2) INFORMATION FOR SEQ ID NO:26: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
83" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 


ATCGCCTTCA GCTTCTCCTC C 


(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 


Met Glu Glu Lys Leu Lys Ala 
1 5 


(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Glu Glu Lys Met Lys Ala 
1 ' 5 


21 


21 
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(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C15 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..151 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 
/standard_name= 

" 5 . 7 . 7 . 7 . 7 . 7 . 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 94 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAA GAG AAG ATG 142 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met 

35 40 45 

AAG GCG TGATAGGTAC CG 160 

Lys Ala 

50 

(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 



(A) 

LENGTH: 

49 amino acids 


(B) 

TYPE: amino acid 


(D) 

TOPOLOGY: 

linear 

(ii) 

MOLECULE TYPE 

: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
15 10 15 
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Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 

Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 


Ala 

(2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: C20 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..151 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 
/product= "protein" 
/gene= "ssp" 

/standard_name= 

"5.7.7.7.7.7.5" 


(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO:31: 


C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 


GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAA 

GAG 

AAG 

ATG 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Met 


AAG GCG TGATAGGTAC CG 160 

Lys Ala 

50 
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(2) INFORMATION FOR SEQ ID NO:32: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 


Met 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

1 




5 





10 





15 


Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 




20 





25 





30 



Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Met 

Lys 


35 40 45 


Ala 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii ) IMMEDIATE SOURCE: 

(B) CLONE: C30 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..130 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 

/standard_name= 

" 5 . 7 . 7 . 7 . 7 . 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 

GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 94 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 
20 25 30 
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139 


AAG CTG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC CG 
Lys Leu Lys Ala Met Glu Glu Lys Met Lys Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Ala 

Met 

Glu 

Glu 

10 

Lys 

Leu 

Lys 

Ala 

Met 

15 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Met 

40 

Lys 

Ala 








(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: D16 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 
/product= "protein" 
/gene= "ssp" 

/standard_name= 

"5.5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 

15 10 15 


46 
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GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:36: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 


Met Glu Glu 
1 


Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
5 10 15 


Glu Lys Met 


Lys Ala Met Glu Glu Lys Met 
20 25 


Lys Ala 


(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii ) IMMEDIATE SOURCE: 

(B) CLONE: D20 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..109 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein" 

/product- "protein" 

/gene= "ssp" 
/standard_name= 

"5.5.5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 
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GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG GAA GAG 94 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu Glu 

20 25 30 

AAG ATG AAG GCG TGATAGGTAC CG 118 

Lys Met Lys Ala 

35 

(2) INFORMATION FOR SEQ ID NO:38: 

{ i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys 
20 25 30 

Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: D33 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function^ "synthetic 

storage protein" 

/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.5.5.5" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 


C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 
Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO: 40: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


(2) INFORMATION FOR SEQ ID NO:41: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard__name= "SM 
86 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 


GATGGAGGAG AAGCTGAAGA A 


21 


(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
<B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product— "synthetic 

oligonucleotide" 
/standard_name= "SM 
87" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
ATCTTCTTCA GCTTCTCCTC C 


(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
88 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GATGGAGGAG AAGCTGAAGT G 


(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
89" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 


ATCCACTTCA GCTTCTCCTC C 


21 


21 
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( 2 ) 


INFORMATION FOR SEQ ID NO:45: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 

/standard_name= "SM 
90" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 


GATGGAGGAG AAGATGAAGA A 


(2) INFORMATION FOR SEQ ID NO:46: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
91" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 


ATCTTCTTCA TCTTCTCCTC C 


(2) INFORMATION FOR SEQ ID NO:47: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 


21 


21 
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(D) OTHER INFORMATION: /product^ "synthetic 

oligonucleotide" 
/standard_name= "SM 
92" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GATGGAGGAG AAGATGAAGT G 21 

(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

' (A) NAME/KEY: misc_feature 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
93" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

ATCCACTTCA TCTTCTCCTC C 21 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Met Glu Glu Lys Leu Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 


Met Glu Glu Lys Leu Lys Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

Met Glu Glu Lys Met Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Met Glu Glu Lys Met Lys Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 82-4 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..151 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 
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46 


/gene= "ssp M 
/standard_name= 

"7.7.7.7.7.7.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

C ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 
Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 
15 10 15 


GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

94 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 


AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAA 

GAG 

AAG 

ATG 

142 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Met 



AAG GCG TGATAGGTAC CG 160 

Lys Ala 

50 

(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 


Met 

1 

Glu 

Glu 

Lys 

Leu 

5 

Lys 

Ala 

Met 

Glu 

Glu 

10 

Lys 

Leu 

Lys 

Ala 

Met 

15 

Glu 

Glu 

Lys 

Leu 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Met 

Lys 


Ala 

(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: 84-H3 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.5.5.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 

GAG GAG AAG ATG AAG GCG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 

Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:56: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: 


Met Glu Glu 
1 


Lys Met Lys Ala Met Glu 
5 


SEQ ID NO:56: 

Glu Lys Met Lys Ala Met Glu 
10 15 


Glu Lys Met 


Lys Ala Met Glu Glu Lys Met 
20 25 


Lys Ala 


(2) INFORMATION FOR SEQ ID NO:57: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 


(vii) IMMEDIATE SOURCE: 

(B) CLONE: 86-H23 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 88 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"5.8.8.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG CTG AAG AAG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Lys Met 
15 10 15 

GAG GAG AAG CTG AAG AAG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 

Glu Glu Lys Leu Lys Lys Met Glu Glu Lys Met Lys Ala 
20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:58: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 


Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Leu Lys Lys Met Glu 
15 10 15 


Glu Lys Leu Lys Lys Met 
20 


Glu Glu Lys Met 
25 


Lys Ala 


(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 88-2 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..103 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

" 5 . 9 . 9 . 9 . 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

C ATG GAG GAG AAG ATG AAG GCG AAG AAG CTG AAG TGG ATG GAG GAG 4 6 

Met Glu Glu Lys Met Lys Ala Lys Lys Leu Lys Trp Met Glu Glu 

15 10 15 

AAG CTG AAG TGG ATG GAG GAG AAG CTG AAG TGG ATG GAA GAG AAG ATG 94 

Lys Leu Lys Trp Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met 

20 25 - 30 

AAG GCG TGATAGGTAC CG 112 

Lys Ala 

(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Glu Glu Lys Met Lys Ala Lys Lys Leu Lys Trp Met Glu Glu Lys 

1 5 .10 15 

Leu Lys Trp Met Glu Glu Lys Leu Lys Trp Met Glu Glu Lys Met Lys 

20 25 30 


Ala 

(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 90-H8 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..109 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 
"5.10.10.10.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG AAG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Lys Met 
15 10 15 

GAG GAG AAG ATG AAG AAG ATG GAG GAG AAG ATG AAG AAG ATG GAA GAG 94 

Glu Glu Lys Met Lys Lys Met Glu Glu Lys Met Lys Lys Met Glu Glu 

20 25 30 

AAG ATG AAG GCG TGATAGGTAC CG 118 

Lys Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Lys Met Glu 

1 5 10 15 

Glu Lys Met Lys Lys Met Glu Glu Lys Met Lys Lys Met Glu Glu Lys 

20 25 30 

Met Lys Ala 
35 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: 92-2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..88 

(D) OTHER INFORMATION: /function- "synthetic 

storage protein 
/product- "protein" 

/gene- "ssp" 

/standard_name- 

"5.11.11.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG TGG ATG 4 6 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Trp Met 
15 10 15 

GAG GAG AAG ATG AAG TGG ATG GAA GAG AAG ATG AAG GCG TGATAGGTAC 95 

Glu Glu Lys Met Lys Trp Met Glu Glu Lys Met Lys Ala 

20 25 


CG 


97 


(2) INFORMATION FOR SEQ ID NO:64: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) 

(xi) 

Met Glu Glu 
1 


MOLECULE TYPE: protein 

SEQUENCE DESCRIPTION: SEQ ID NO:64: 

Lys Met Lys Ala Met Glu Glu Lys Met Lys Trp Met Glu 
5 10 15 


Glu Lys Met 


Lys Trp Met Glu Glu Lys Met 
20 25 


Lys Ala 


(2) INFORMATION FOR SEQ ID NO:65: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
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/standard_name= "SM 
96” 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

GATGGAGGAA AAGATGAAGG CGATGGAGGA GAAAATGAAA GCTATGGAGG AAAAGATGAA 60 
AGCGATGGAG GAGAAAATGA AGGC 84 

(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

.(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
97" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

ATCGCCTTCA TTTTCTCCTC CATCGCTTTC ATCTTTTCCT CCATAGCTTT CATTTTCTCC 60 
TCCATCGCCT TCATCTTTTC CTCC 84 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /label= name 

/note= "(SSP 5)4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met Glu 
15 10 15 

Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala 
20 25 
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(2) INFORMATION FOR SEQ ID NO:68: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
98" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 


GATGGAGGAA AAGCTGAAAG CGATGGAGGA GAAACTCAAG GCTATGGAAG AAAAGCTTAA 60 


AGCGATGGAG GAGAAACTGA AGGC 


84 


(2) INFORMATION FOR SEQ ID NO:69: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
99" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 


ATCGCCTTCA GTTTCTCCTC CTACGCTTTA AGCTTTTCTT CCATAGCCTT GAGTTTCTCC 60 


TCCATCGCTT TCAGCTTTTC CTCC 


84 


(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /label- name 

/note— "(SSP 7)4" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
15 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala 
20 25 


(2) INFORMATION FOR SEQ ID NO:71: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..8 4 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
100 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 


GATGGAGGAA AAGCTTAAGA AGATGGAAGA AAAGCTGAAA TGGATGGAGG AGAAACTCAA 60 


AAAGATGGAG GAAAAGCTTA AATG 


84 


(2) INFORMATION FOR SEQ ID NO:72: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..8 4 

(D) OTHER INFORMATION: /product- "synthetic 

oligonucleotide" 
/standard_name= "SM 
101 " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

ATCCATTTAA GCTTTTCCTC CTACTTTTTG AGTTTCTCCT CCATCCATTT CAGCTTTTCT 60 
TCCATCTTCT TAAGCTTTTC CTCC 84 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Met Glu Glu Lys Leu Lys Lys Met Glu Glu Lys Leu Lys Trp Met Glu 
15 10 15 

Glu Lys Leu Lys Lys Met Glu Glu Lys Leu Lys Trp 
20 25 

(2) INFORMATION FOR SEQ ID NO:74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 2-9 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..235 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

"7.7.7.7.7.7.8.9.8.9.5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

C ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 4 6 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 

1 5 10 15 i 
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94 


GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG 

Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu 

20 25 30 

AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG GAG GAA AAG CTT 142 

Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu 

35 40 45 


AAG 

AAG 

ATG 

GAA 

GAA 

AAG 

CTG 

AAA 

TGG 

ATG 

GAG 

GAG 

AAA 

CTC 

AAA 

AAG 

190 

Lys 

Lys 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Trp 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Lys 




50 





55 





60 





ATG 

GAG 

GAA 

AAG 

CTT 

AAA 

TGG 

ATG 

GAA 

GAG 

AAG 

ATG 

AAG 

GCG 

TGATAGGTAC 

242 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Trp 

Met 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 





65 





70 





75 







C 243 


(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 


<ii) MOLECULE TYPE: protein 



(xi) 

SEQUENCE DESCRIPTION: 

SEQ 

ID NO:75: 




Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

1 




5 





10 





15 


Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 




20 





25 





30 



Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 



35 





40 





45 




Lys 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Trp 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Lys 

Met 


50 





55 





60 





Glu 

Glu 

Lys 

Leu 

Lys 

Trp 

Met 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 




65 





70 





75 






(2) 

INFORMATION FOR SEQ ID NO: 

76: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 
(G) CELL TYPE: DH5 alpha 
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{vii) IMMEDIATE SOURCE: 

(B) CLONE: 5-1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..172 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product= "protein" 

/gene= "ssp" 

/standard_name= 

" 5 . 5 . 5 . 7 . 7 . 7 . 7 . 5 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

C ATG GAG GAG AAG ATG AAG GCG ATG GAG GAG AAG ATG AAG GCG ATG 46 

Met Glu Glu Lys Met Lys Ala Met Glu Glu Lys Met Lys Ala Met 
15 10 15 


GAG 

GAG 

AAG 

ATG 

AAG 

GCG 

ATG 

GAG 

GAA 

AAG 

CTG 

AAA 

GCG 

ATG 

GAG 

GAG 

94 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 






20 





25 





30 



AAA 

CTC 

AAG 

GCT 

ATG 

GAA 

GAA 

AAG 

CTT 

AAA 

GCG 

ATG 

GAG 

GAG 

AAA 

CTG 

142 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 





35 





40 





45 




AAG 

GCC 

ATG 

GAA 

GAG 

AAG 

ATG 

AAG 

GCG 

TGATAG 






175 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 











50 





55 











(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Ala 

Met 

Glu 

Glu 

10 

Lys 

Met 

Lys 

Ala 

Met 

15 

Glu 

Glu 

Lys 

Met 

Lys 

20 

Ala 

Met 

Glu 

Glu 

Lys 

25 

Leu 

Lys 

Ala 

Met 

Glu 

30 

Glu 

Lys 

Leu 

Lys 

Ala 

35 

Met 

Glu 

Glu 

Lys 

Leu 

40 

Lys 

Ala 

Met 

Glu 

Glu 

45 

Lys 

Leu 

Lys 

Ala 

Met 

50 

Glu 

Glu 

Lys 

Met 

Lys 

55 

Ala 
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(2) 


INFORMATION FOR SEQ ID NO:78: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: E. coli 

(G) CELL TYPE: DH5 alpha 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..173 

(D) OTHER INFORMATION: /function= "synthetic 

storage protein 
/product^ "protein" 

/gene= "ssp" 
/standard_name= 

"SSP-3-5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

CC ATG GAG GAG AAG CTG AAG GCG ATG GAG GAG AAG CTG AAG GCG ATG 47 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met 

15 10 15 


GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

95 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 






20 





25 





30 



AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAG 

AAG 

CTG 

AAG 

GCG 

ATG 

GAG 

GAA 

AAG 

ATG 

143 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Leu 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Met 





35 





40 





45 




AAG 

GCG 

ATG 

GAA 

GAG 

AAG 

ATG 

AAG 

GCG 

TGATAGGTAC CGAATTC 



187 

Lys 

Ala 

Met 

Glu 

Glu 

Lys 

Met 

Lys 

Ala 











50 





55 











(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79:. 

Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu 
15 10 15 

Glu Lys Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys 
20 25 30 
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Leu Lys Ala Met Glu Glu Lys Leu Lys Ala Met Glu Glu Lys Met Lys 
35 40 45 

Ala Met Glu Glu Lys Met Lys Ala 
50 55 


(2) INFORMATION FOR SEQ ID NO:80: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..61 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
107" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 


CATGGAGGAG AAGATGAAAA AGCTCGAAGA GAAGATGAAG GTCATGAAGT GATAGGTACC 60 


G 


61 


(2) INFORMATION FOR SEQ ID NO:81: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..61 

(D) OTHER INFORMATION: /product= "synthetic 

ligonucleotide" 

/standard_name= "SM 
106" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 


AATTCGGTAC CTATCACTTC ATGACCTTCA TCTTCTCTTC GAGCTTTTTC ATCTTCTCCT 60 
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(2) 


INFORMATION FOR SEQ ID NO:82: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 


(ii) MOLECULE TYPE: protein 


(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..16 

(D) OTHER INFORMATION: /label= name 

/note= "pSK34 base 
gene" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 


Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys Val Met Lys 
1 5 10 15 


(2) INFORMATION FOR SEQ ID NO:83: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. . 63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
110 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 


GCTGGAAGAA AAGATGAAGG CTATGGAGGA CAAGATGAAA TGGCTTGAGG AAAAGATGAA 60 


GAA 


63 


(2) INFORMATION FOR SEQ ID NO:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
111 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

AGCTTCTTCA TCTTTTCCTC AAGCCATTTC ATCTTGTCCT CCATAGCCTT CATCTTTTCT 60 
TCC 63 


(2) INFORMATION FOR SEQ ID NO:85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Lys 

Leu 

Glu 

Glu 

10 

Lys 

Met 

Lys 

Ala 

Met 

15 

Glu 

Asp 

Lys 

Met 

Lys 

20 

Trp 

Leu 

Glu 

Glu 

Lys 

25 

Met 

Lys 

Lys 

Leu 

Glu 

30 

Glu 

Lys 

Met 

Lys 

Val 

35 

Met 

Lys 













(2) INFORMATION FOR SEQ ID NO:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Lys 

Leu 

Glu 

Glu 

10 

Lys 

Met 

Lys 

Ala 

Met 

15 

Glu 

Asp 

Lys 

Met 

Lys 

20 

Trp 

Leu 

Glu 

Glu 

Lys 

25 

Met 

Lys 

Lys 

Leu 

Glu 

30 

Glu 

Lys 

Met 

Lys 

Val 

35 

Met 

Lys 
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( 2 ) 


INFORMATION FOR SEQ ID NO:87: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..62 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucletide" 
/standard_name= "SM 
112 " 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: 

GCTCGAAGAA AGATGAAGGC AATGGAAGAC AAAATGAAGT GGCTTGAGGA GAAAATGAAG 
AA 


60 

62 


(2) INFORMATION FOR SEQ ID NO:88: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..62 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
113" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

AGCTTCTTCA TTTTCTCCTC AAGCCACTTC ATTTTGTCTT CCATTGCCTT CATCTTTCTT 60 


CG 


62 


(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 



(A) 

LENGTH: 

37 amino acids 


(B) 

TYPE: amino acid 


(D) 

TOPOLOGY: 

linear 

(ii) 

MOLECULE TYPE 

: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 


Met 

1 

Glu 

Glu 

Lys 

Met 

5 

Lys 

Lys 

Leu 

Lys 

Glu 

10 

Glu 

Met 

Ala 

Lys 

Met 

15 

Lys 

Asp 

Glu 

Met 

Trp 

20 

Lys 

Leu 

Lys 

Glu 

Glu 

25 

Met 

Lys 

Lys 

Leu 

Glu 

30 

Glu 

Lys 

Met 

Lys 

Val 

35 

Met 

Lys 













(2) INFORMATION FOR SEQ ID NO:90: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. . 63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard__name= "SM 
114" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: 

GCTCAAGGAG GAAATGGCTA AGATGAAAGA CGAAATCTGG AAACTGAAAG AGGAAATGAA 60 


GAA 


63 


(2) INFORMATION FOR SEQ ID NO:91: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 . . 63 

(D) OTHER INFORMATION: /product= "synthetic 

oligonucleotide" 
/standard_name= "SM 
115" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 


AGCTTCTTCA TTTCCTCTTT CAGTTTCCAC ATTTCGTCTT TCATCTTAGC CATTTCCTCC 60 


TTG 


63 
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(2) INFORMATION FOR SEQ ID NO:92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

Met Glu Glu Lys Met Lys Lys Leu Lys Glu Glu Met Ala Lys Met Lys 

15 10 15 

Asp Glu Met Trp Lys Leu Lys Glu Glu Met Lys Lys Leu Glu Glu Lys 

20 25 30 

Met Lys Val Met Glu Glu Lys Met Lys Lys Leu Glu Glu Lys Met Lys 
35 40 45 

Ala Met Glu Asp Lys Met Lys Trp Leu Glu Glu Lys Met Lys Lys Leu 
50 55 60 

Glu Glu Lys Met Lys Val Met Glu Glu Lys Met Lys Lys Leu Glu Glu 

65 70 75 80 

Lys Met Lys Ala Met Glu Asp Lys Met Lys Trp Leu Glu Glu Lys Met 

85 90 95 

Lys Lys Leu Glu Glu Lys Met Lys Val Met Lys 

100 105 

(2) INFORMATION FOR SEQ ID NO:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GGATCCCCCG GGCTGCAGGA ATTCTACGTA CCATATAGTA AGACTTTGTA TATAAGACGT 60 
CACCTCTTAC GTGCATGGTT ATATGTGACA TGTGCAGTGA CGTTGTACCA TATAGTAAGA 120 
CTTTGTATAT AAGACGTCAC CTCTTACGTG CATGGTTATA TGTGACATGT GCAGTGACGT 180 
TAACCGCACC CTCCTTCCCG TCGTTTCCCA TCTCTTCCTC CTTTAGAGCT ACCACTATAT 240 
AAATCAGGGC TCATTTTCTC GCTCCTCACA GGCTCATCAG CACCCCGGCA GTGCCACCCC 300 
GACTCCCTGC ACCTGCCATG GGTACGCTAG CCCGGGAGAT CTGACAAAGC AGCATTAGTC 360 
CGTTGATCGG TGGAAGACCA CTCGTCAGTG TTGAGTTGAA TGTTTGATCA ATAAAATACG 420 
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GCAATGCTGT AAGGGTTGTT TTTTATGCCA TTGATAATAC ACTGTACTGT TCAGTTGTTG 480 
AACTCTATTT CTTAGCCATG CCAGTGCTTT TCTTATTTTG AATAACATTA CAGCAAAAAG 540 
TTGAAAGACA AAAAAANNNN NCCCCGAACA GAGTGCTTTG GGTCCCAAGC TTCTTTAGAC 600 
TGTGTTCGGC GTTCCCCCTA AATTTCTCCC CTATATCTCA CTCACTTGTC ACATCAGCGT 660 
TCTCTTTCCC CTATATCTCC ACGCTCTACA GCAGTTCCAC CTATATCAAA CCTCTATACC 720 
CCACCACAAC AATAT TATAT ACTTTCATCT TCACCTAACT CATGTACCTT CCAATTTTTT 780 
TCTACTAATA ATTATTTACG TGCACAGAAA CTTAGGCAAG GGAGAGAGAG AGCGGTACC 839 
(2) INFORMATION FOR SEQ ID NO:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

CTAGAAGCCT CGGCAACGTC AGCAACGGCG GAAGAATCCG GTG 43 

(2) INFORMATION FOR SEQ ID-NO:95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 

CATGCACCGG ATTCTTCCGC CGTTGCTGAC GTTGCCGAGG CTT 43 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 

GATCCCATGG CGCCCCTTAA GTCCACCGCC AGCCTCCCCG TCGCCCGCCG CTCCT 55 
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( 2 ) 


INFORMATION FOR SEQ ID NO:97: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 

CTAGAGGAGC GGCGGGCGAC GGGGAGGCTG GCGGTGGACT TAAGGGGCGC CATGG 55 

(2) INFORMATION FOR SEQ ID NO:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

CATGGCGCCC ACCGTGATGA TGGCCTCGTC GGCCACCGCC GTCGCTCCGT TCCAGGGGC 59 
(2) INFORMATION FOR SEQ ID NO:99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 

TTAAGCCCCT GGAACGGAGC GACGGCGGTG GCCGACGAGG CCATCATCAC GGTGGGCGC 59 
(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 

GCGCCCACCG TGATGA 16 
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(2) INFORMATION FOR SEQ ID NO:101: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: 

CACCGGATTC TTCCGC 16 

(2) INFORMATION FOR SEQ ID NO:102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: 


GTAAGATTGG 

TAAAGTCCAG 

CAAGAAAATG 

AGATAAAAGA 

GAAGCCTGAA 

AT GACGAAAA 

60 

AATCAGGTGT 

TTTGATTCTT 

GGTGCTGGAC 

GTGTGTNTCG 

CCCAGCTGCT 

GATTTCCTAG 

120 

CTTCAGTTAG 

AACCATTTCG 

TCACAGCAAT 

GGTACAAAAC 

ATATTTCGGA 

GCAGACTCTG 

180 

AAGAGAAAAC 

AGATGTTCAT 

GTGATTGTCG 

CGTCTCTGTA 

TCTTAAGGAT 

GCCAAAGAGA 

240 

CGGTTGAAGG 

TAT T T CAGAT 

GTAGAAGCAG 

TTCGGCTAGA 

TGTATCTGAT 

AGTGAAAGTC 

300 

TCCTTAAGTA 

TGTTTCTCAG 

GTTGATGTTG 

TCCTAAGTTT 

ATTACCTGCA 

AGTTGTCATG 

360 

CTTGTTGTAG 

CA 





372 


(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 


GGAAGCACAC 

TGCGACTCTT 

TTGGAATTCG 

GGGACATCAA 

GAATGGACAA 

ACAACAACCG 

60 

CTATGGCCAA 

GACTGTTGGG 

ATCCCTGCAG 

CCATTGGAGC 

TCTGCTGTTA 

ATTGAAGACA 

120 

AGATCAAGAC 

AAGAGGAGTC 

TTAAGGCCTC 

TCGAAGCAGA 

GGTGTATTTG 

CCAGCTTTGG 

180 
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ATATATTGCA 

AGCATATGGT 

ATAAAGCTGA 

TGGAGAAGGC 

AGAATGATCA 

AAGAACTCTG 

240 

TATATTGTTT 

CTNTCTATAA 

CTTGGAGTTG 

GAGACAAAGC 

TGAAGGAGNC 

AGNGCCATTA 

300 

GACCAGCAAA 

AAAAGGAGGA 

GGA 
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(2) INFORMATION FOR SEQ ID NO:104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 


Lys 

1 

lie 

Gly 

Lys 

Val 

5 

Gin 

Gin 

Glu 

Asn 

.Glu 

10 

lie 

Lys 

Glu 

Lys 

Pro 

15 

Glu 

Met 

Thr 

Lys 

Lys 

20 

Ser 

Gly 

Val 

Leu 

lie 

25 

Leu 

Gly 

Ala 

Gly 

Arg 

30 

Val 

Xaa 

Arg 

Pro 

Ala 

35 

Ala 

Asp 

Phe 

Leu 

Ala 

40 

Ser 

Val 

Arg 

Thr 

lie 

45 

Ser 

Ser 

Gin 

Gin 

Trp 

50 

Tyr 

Lys 

Thr 

Tyr 

Phe 

55 

Gly 

Ala 

Asp 

Ser 

Glu 

60 

Glu 

Lys 

Thr 

Asp 

Val 

65 

His 

Val 

He 

Val 

Ala 

70 

Ser 

Leu 

Tyr 

Leu 

Lys 

75 

Asp 

Ala 

Lys 

Glu 

Thr 

80 

Val 

Glu 

Gly 

lie 

Ser 

85 

Asp 

Val 

Glu 

Ala 

Val 

90 

Arg 

Leu 

Asp 

Val 

Ser 

95 

Asp 

Ser 

Glu 

Ser 

Leu 

100 

Leu 

Lys 

Tyr 

Val 

Ser 

105 

Gin 

Val 

Asp 

Val 

Val 

110 

Leu 

Ser 

Leu 

Leu 

Pro 

115 

Ala 

Ser 

Cys 

His 

Ala 

120 

Cys 

Cys 

Ser 







(2) INFORMATION FOR SEQ ID NO:105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: 

Lys His Thr Ala Thr Leu Leu Glu Phe Gly Asp lie Lys Asn Gly Gin 
15 10 15 
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Thr Thr Thr Ala Met Ala Lys Thr Val Gly lie Pro Ala Ala lie Gly 
20 25 30 

Ala Leu Leu Leu lie Glu Asp Lys lie Lys Thr Arg Gly Val Leu Arg 
35 40 45 

Pro Leu Glu Ala Glu Val Tyr Leu Pro Ala Leu Asp lie Leu Gin Ala 
50 55 60 

Tyr Gly lie Lys Leu Met Glu Lys Ala Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 

ATTCCCCATG GTTTCGCCGA CGAAT 25 

(2) INFORMATION FOR SEQ ID NO:107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 

CTCTCGGTAC CTAGTACCTA CTGATCAAC 29 

(2) INFORMATION FOR SEQ ID NO:108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

AGAGAAGCCT GAAATGACGA AAAA 24 
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(2) INFORMATION FOR SEQ ID NO:109: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: 

GTCTTGGCCA TAGCGGTTGT TGTT 24 

(2) INFORMATION FOR SEQ ID NO:110: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: 

DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ 

ID NO:110: 



T C TAGAT G CA 

CATTCAACTC 

GAGGTTGTTG 

CATGATGTTT 

CAT T TAC CAA 

AAAAAT CATA 

60 

GTCAAATTAT 

GTAAGCAAAT 

GATATTACAG 

AAAAGTTTTA 

CTAGAGAGTT 

TCAGATTTAC 

120 

ACATGCACAA 

CGTTAAAAAA 

AATAGCAGAA 

AAAAGAAAGA 

AGAAAAGTTC 

TTTATTTGTG 

180 

AGAAAAAT G T 

ATGAAAAAAA 

AAGAGATGGG 

TGTAAAAAGC 

AAAAGGATAG 

GACCACTGTT 

240 

ACTTTGTAGC 

CTCGTTGAGG 

AATCTCTTCT 

CGCATCTCGA 

CTTTTGTGCC 

ATTGCAAAGT 

300 

CAATGCCCAG 

AACTTGTTCC 

CAGGCCATCT 

CCAATTAACT 

ACGTCTATTT 

AATTAAACTT 

360 

TTAAAAGAAA 

ACCTAATAAA 

TTAAACAAAA 

GAAAAGCCGT 

CAACGAAATC 

TAAGCTTGCA 

420 

GCGATATCGA 

TGAACTGATA 

CCAAAACAAT 

GTTCAAGTTT 

CACTTTCAAA 

TTGTTTTTTC 

480 

TTGAAATAGT 

TTATTGGGTA 

AGGCCCATAG 

ATATTTCATA 

AGAAGAACAC 

TTGTCGAGGT 

540 

TGAATCGTAT 

GTCTGCCCAC 

CGCGGCCCAT 

GCATCCTCTG 

TTGGTAGCAT 

AATCGTTTTA 

600 

GGCCATACTA 

TTGTTCGTAC 

ACACTGATTT 

TGAAGTCACC 

TTTGTGCACT 

CCTTAATTCC 

660 

TAAATTGAAG 

AAGCTTGTTC 

TCATTCTTCT 

TTGGGTTACA 

AATGCCAAGG 

CAAAAGGAAC 

720 

TTGGGCCAAA 

TTAAGACAAC 

AACTCAAGCC 

CACTCTCTGC 

AAATAATACT 

TGGGAATTTT 

780 

TACTAAAACG 

GTGCGTTTCA 

TCCAAGAATC 

TATTAATATC 

CCTAACTTGA 

AAT CAT CATA 

840 

TACGTAACCC 

AACATAT TAA 

AGAGTTAATA 

ATGTTAAAAA 

AAGTCTCAGA 

AGAGAGAGAC 

900 

GTAGAGAACA 

CGGAAAGTGG 

TAACTGGTAA 

GCGTCGTCAT 

CGAGGATATA 

GTAGCTACGT 

960 
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GAGCAAACGT 

CTTCACTCAT 

CTCTGTCTAT 

TTCTCTTCGA 

ATACACGTAA 

TACATTTTCG 

1020 

ATTGGATTGA 

TCCTCCCTCG 

GTCCTATCCA 

AGTATCCATC 

CACGTAAACA 

AGAGCTTGTT 

1080 

CCTTTCTTGT 

TTTTTCTTTC 

TTTAAATAGT 

AAAAATACTT 

ATTTCATTTG 

TTTCGTTTGA 

1140 

TTTCATTATT 

ATTGTCTATG 

GCATTATATA 

CTATATATAT 

TATTTCTACA 

ACATTGGCTG 

1200 

GCTCACGTTG 

TTCTCGTGTA 

TACAACAAAC 

TTAATTAATG 

TCTCTCTATT 

GCATTAGATA 

1260 

GTTTCGGAGC 

ATATCCATTA 

TGTGAAAGCC 

ACATTAAGTT 

ATAACTAAAA 

GTAGTTTTCG 

1320 

AAAGAGCTTA 

ATTAAGTTAT 

GTTCTGTTTC 

AAATAAAAAT 

GAACACGAGG 

GATTTTTTTT 

1380 

TT T TTTGACA 

GATCATTATT 

AACAAAAATG 

AT TAC C T GAA 

GAAAGGGGAA 

AATAATTATA 

1440 

GCTGATTACA 

GATCATTATT 

AACAAAAAGA 

ATTCTTGTCA 

CATCATTCAT 

TATAACAAGA 

1500 

AATATTATAT 

TATATTAATT 

TAATCTTTCG 

CTAACACGCC 

CACAATATAT 

TAATCATATA 

1560 

CGTAATTTAG 

CTTATAAAAA 

GGACGGAAAG 

AGATTATTAC 

TGCGCCTAAA 

AAACTCACTA 

1620 

ATTCCAAAGA 

AAAAAAAAAG 

CTTGTATTTT 

TTCTTGACAA 

ACCAGCTCAC 

AGGCATTGCA 

1680 

TGATCAAACT 

CAT C AG G T AC 

GTTTTGATTC 

CTTCTTCCAT 

AATTTTCCCA 

TCTTGAGGAA 

1740 

TGCAAATTTG 

GAGAGCGCTT 

TAGCTAAATC 

ACTGCCTTCA 

TTTTTTCACT 

TTGGATTTAA 

1800 

TAATTTGCAT 

TCCTCTCTTC 

CTCTCTGCTC 

TGTTCTGTTC 

TGTTCTGTTC 

TGATTTGAGT 

1860 

TTTCAATTAA 

TCGCTCGAGC 

AAAAGCTATT 

TCTCAACTCG 

TTAAATTTCT 

GTTCCCAGTT 

1920 

TGTTCGATTT 

T CAACAG T T T 

CACATTAAAG 

TTTGGGTTTT 

TGATGTTTGG 

TTGATGAAAC 

1980 

TCGAAATATG 

AAATGTTTGT 

GAATCTATTC 

CAGGGTGTTT 

AAAATAAGGG 

TTTGTTGTTC 

2040 

AT C T G CAGAG 

AT TATAT G T T 

T T TACAT GAA 

AGATGAATTC 

AAATGGCCAT 

GAGGAGGAGA 

2100 

AGAAGTTGGG 

GAATGGAGTT 

GTGGGGATTC 

TAGCTGAAAC 

AGTTAACAAA 

TGGGAGAGAC 

2160 

GAACAC CAT T 

GACGCCATCG 

CATTGCGCTC 

GCCTTTTACA 

CGGTGGGAAA 

GACAGAAC C G 

2220 

GCATTTCCCG 

CATTGTGGTT 

CAGCCATCTG 

CTAAGCGTAT 

CCATCATGAT 

GCCTTGTATG 

2280 

AAGATGTTGG 

GTGTGAAATT 

TCTGATGATT 

TGTCTGATTG 

TGGGCTTATA 

CTTGGAATCA 

2340 

AACAAC C T GA 

GGTGTGGGAA 

TTTGCATTAA 

AAAGAGTTCC 

TTTTTTTCTT 

CTATATATAT 

2400 

ATCAGTTTAT 

GAGATTTGAT 

TCTGTTTGCA 

GCTAGAAATG 

ATTCTTCCAG 

AGAGAGCATA 

2460 

CGCTTTCTTT 

TCACATACTC 

ATAAGGCACA 

GAAAGAGAAC 

ATGCCTTTGT 

TGGATAAAGT 

2520 

ATTACACTTT 

TCATTTATCC 

TTTTAGTCCT 

AT C TAAGATA 

CTGAGGAATG 

TTGACAAAAG 

2580 

GGGTATCCAA 

TTGCAGATTC 

TTTCTGAGAG 

AGTGACTTTG 

TGTGATTATG 

AGCTCATTGT 

2640 

TGGGGATCAT 

GGGAAACGAT 

TATTGGCGTT 

TGGTAAATAT 

GCAGGCAGAG 

CTGGTCTTGT 

2700 
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TGACTTCTTA 

CACGGACTTG 

GACAGCGTAA 

GCTCATGTTA 

TAATTCTGAT 

GATCAGGACA 

2760 

TGTTTCTGTG 

CAGAACAAGA 

TGAGATGTAA 

TTTTCCATGT 

TTGATGCAGG 

ATATCTAAGT 

2820 

CTAGGATACT 

CAACACCTTT 

CCTCTCGCTC 

GGTGCATCGT 

ATATGTATTC 

CTCATTGGCT 

2880 

GCTGCAAAAG 

CCGCTGTAAT 

TTCTGTTGGT 

GAAGAAATTG 

CAAGCCAGGG 

ACTGCCATTA 

2940 

GGAATCTGCC 

CTCTTGTATT 

TGTCTTCACC 

GGAACAGGAA 

ATGGTATCTT 

CTTTAGTTCT 

3000 

ACTGCGAGTT 

CTTTGAATCC 

TTCTGCATAT 

GTTTCATCTC 

ATTAAAAAAT 

TTCTCATCCG 

3060 

CAGTTTCTCT 

GGGGGCGCAA 

GAAATTTTCA 

AGCTTCTTCC 

TCACACTTTT 

GTTGAACCAA 

3120 

GCAAACTTCC 

TGAACTATTT 

GTAAAAGTAA 

GTCACGCTTT 

GCTTTTTATT 

TGGTTTCAGA 

3180 

GTTTTGAAGA 

TTCTGAAATG 

TATATTTCTC 

ACAG GACAAA 

GGAATTAGTC 

AAAATGGGAT 

3240 

TTCAACAAAG 

CGAGTCTATC 

AAGTATATGG 

TTGTATTATT 

ACCAGCCAAG 

ACATGGTTGA 

3300 

ACACAAAGAT 

C CAT CAAAG T 

CATTCGACAA 

AGTAACACTT 

ACCTTCTTAG 

CTCCTTGGCT 

3360 

GTGACTTTTG 

TTCCACTACG 

C T AAAG TAG A 

ATAC C TAT TA 

ATTCTTCAAG 

CTTATGATGT 

3420 

TTAGGCCGAC 

TATTATGCAC 

ACCCGGAACA 

TTACAATCCA 

GTTTTCCACG 

AAAAGATAT C 

3480 

GCCATATACG 

TCTGTTCTTG 

GTAGATCCTG 

ATCACTGTTT 

TACCTTTAAA 

GCTCAAGAGT 

3540 

T T AC AT AT AA 

GCAAATCCTC 

TGTCCACTCC 

GTGACTGTGA 

CCATCTCATT 

TTGGTTAGTT 

3600 

CCAGTGTGTA 

ACCCCTATGA 

CTTTCTGTGC 

AGTAAACTGT 

ATGTACTGGG 

AGAAGAGG T T 

3660 

TCCCTGTCTT 

CTGAGCACAA 

AACAGCTTCA 

AGATTTAACA 

AAAAAAGGAC 

TCCCACTAGT 

3720 

AGGCATATGT 

GATATAACTT 

GTGACATCGG 

TGGCTCCATT 

GAATTTGTTA 

ACCGAGCTAC 

3780 

TTTAATCGAT 

TCCCCTTTCT 

TCAGGTAATA 

TATACTTAGG 

AAGAGCTTTC 

TTTTGAGTCA 

3840 

TCTACGTTTA 

C TAT GAT GAA 

ACTCGTCGAG 

CTAAACACTA 

TCTCTAGGTT 

TAATCCCTCG 

3900 

AACAATTCAT 

ACTACGATGA 

CATGGATGGG 

GATGGCGTAC 

TATGCATGGC 

TGTTGACATT 

3960 

TTACCCACAG 

AATTTGCAAA 

AGAGGTATGT 

ATGAAGGTTA 

CAGTTATAGT 

ACTTAAGATT 

4020 

AAATCTAAAG 

TTAAAAACCT 

TGTATTGAGT 

GGGAGTTCTT 

GTGTCCTGAA 

AAAGGCATCC 

4080 

CAGCATTTTG 

GAGATATTCT 

TTCCGGATTT 

GTCGGTAGTT 

TGGCTTCAAT 

GACTGAAATT 

4140 

TCAGATCTAC 

CAGCACATCT 

GAAGAGGGCT 

TGCATAAGCT 

ATAGGGGAGA 

ATTGACATCT 

4200 

TTGTATGAGT 

ATATTCCACG 

TATGAGGAAG 

TCAAATCCAG 

AGTATGTTCT 

GCTTCGAGCG 

4260 

TTACTTCATC 

TGAAATATTT 

AGGCCTCTTC 

TCTAAACTAT 

GTTTTCATCT 

TTACCCACTT 

4320 

TAACTGCAGA 

GAGGCACAAG 

ATAATATTAT 

CGCCAACGGG 

GTTTCCAGCC 

AGAGAACATT 

4380 

CAACATATTG 

GTTAGTTTTG 

ATGAAGAAAG 

TATATATAAC 

TAGTTTCCGA 

ATCATATGAT 

4440 
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TTAAGCTAAT 

GAATTAAGAA 

AATATATAGT 

TCAAGACTTA 

TGATTCATAT 

CTCTATCAAC 

4500 

TTTTTGACCA 

AAGAT T GATA 

CTTTTTCGAC 

ATCTGTCACA 

GCATTTTGTG 

ATGATTTTGA 

4560 

T TGAGACAAA 

TCATTTGTAG 

GTATCTCTGA 

GCGGACACCT 

AT T TG AT AAG 

TTTCTGATAA 

4620 

ACGAAGCTCT 

TGATATGATC 

GAAGCGGCTG 

GTGGCTCATT 

TCATTTGGCT 

AAATGTGAAC 

4680 

TGGGGCAGAG 

CGCTGATGCT 

GAATCGTACT 

CAGAACTTGA 

AGTAAGTTTC 

TTTCTGGATA 

4740 

AAACCTAATC 

ATTCACATGG 

AACAACTGTC 

AAGAGTTTTT 

AATGTCACGT 

TTAGGTTCAA 

4800 

TGTCCTTTTC 

ACTAAGTCTC 

GTAAGTTTTT 

AAAACAAG TA 

AACAAACTAC 

AAGCCAAAAA 

4860 

CATTCTGGCC 

C CACAT TAAC 

CTATTCCCAC 

TTGTTAAAGA 

ACCCATCTTG 

CATTATCTTG 

4920 

GTAGGTTGGT 

GCGGATGATA 

AGAGAGTATT 

GGATCAAATC 

ATTGATTCAT 

TAACTCGGTT 

4980 

AGCTAATCCA 

AATGAAGATT 

ATATATCCCC 

ACATAGAGAA 

G CAAATAAGA 

TCTCACTGAA 

5040 

GATTGGTAAA 

GTCCAGCAAG 

AAAAT GAGAT 

AAAAGAGAAG 

CCTGAAATGA 

CGAAAAAATC 

5100 

AGGCGTTTTG 

ATTCTTGGTG 

CTGGACGTGT 

GTGTCGCCCA 

GCTGCTGATT 

TCCTAGCTTC 

5160 

AGTTAGAACC 

ATTTCGTCAC 

AGCAATGGTA 

CAAAACATAT 

TTCGGAGCAG 

AC T C T GAAGA 

5220 

GAAAACAGAT 

GTTCATGTGA 

TTGTCGCGTC 

TCTGTATCTT 

AAGGATGCCA 

AAGAGGTAGG 

5280 

AGAAGCCTTT 

GGGCTTCATC 

TGAGTAATTC 

AGTGTATACG 

ATGAACTATC 

AATCTTTTAA 

5340 

AGTTTTACTG 

AT GAT CAAAT 

TTTCCGCAGA 

CGGTTGAAGG 

TATTTCAGAT 

GTAGAAGCAG 

5400 

TTCGGCTAGA 

TGTATCTGAT 

AGTGAAAGTC 

TCCTTAAGTA 

tgtttctcag 

GTATTTTCCT 

5460 

AACTTCTCTG 

TTCTTAGATC 

ACCTTTACTT 

CAAACTCCAC 

TGTTCAAATC 

CATGATCTTA 

5520 

TATTTTTTTT 

TCATTGCACG 

CAGGTTGATG 

TTGTCCTAAG 

TTTATTACCT 

GCAAGTTGTC 

5580 

ATGCTGTTGT 

AGCAAAGACA 

TGCATTGAGG 

TAAATTCCTA 

ACGTTTAATG 

CGTTTTCCGA 

5640 

GTGAAGTTAT 

GAAATTTGCA 

AATGTTATTC 

GACATAGAGG 

TTAAACTTCC 

TCTGCATAAC 

5700 

ACATTCTTTC 

AGTAGTTTCC 

GGTTCCTAAA 

TGTCTCTGTT 

TCTTCTTTCT 

GATTCACTCA 

5760 

GCTGAAGAAG 

CATCTCGTCA 

CTGCTAGCTA 

TGTTGATGAT 

GAAACGTCCA 

T G T TACAT GA 

5820 

GAAGGCTAAG 

AGTGCTGGGA 

TAACGATTCT 

AGGCGAAATG 

GGACTGGACC 

CTGGAATCGG 

5880 

TATGATATCT 

CACAACATAG 

TATCTCTTAA 

GATCATTTGT 

TCACTTGATT 

TAACTTAAGT 

5940 

GCATTTATCT 

TCAAAATATT 

TCCCGGATAA 

CTGAGAAGGT 

GATCCTACAA 

TGAATCTTTC 

6000 

AGATCACATG 

ATGGCGATGA 

AAATGATCAA 

CGATGCTCAT 

ATCAAAAAAG 

GGAAAGTGAA 

6060 

gtcttttacc 

TCTTATTGTG 

GAGGGCTTCC 

CTCTCCTGCT 

GCAGCAAATA 

ATCCATTAGC 

6120 

ATATAAAT T T 

AGGTACGGTA 

GTCCTTTACG 

CCATTAACAT 

ATTTTGTTTT 

GTTTAACTCA 

6180 
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TTTAGACATC CTTTCAGAAT TTCGCTTACT CAATTACATC TCGGTATTTT CAGCTGGAAC 6240 
CCTGCTGGAG CAATTCGAGC TGGTCAAAAC CCCGCCAAAT ACAAAAGCAA CGGCGACATA 6300 
ATACATGTTG ATGGTATGAA AAACAAAATA TGTCTACATG CAGGAGAGGT TGGAGTAGTT 6360 
TAGCTTCACT ACACATCATT TTTGTTTAAC CGAGCAATGT AAATCGCAGG GAAGAATCTC 6420 
TATGATTCCG CGGCAAGATT CCGAGTACCT AATCTTCCAG CTTTTGCATT GGAGTGTCTT 6480 
CCAAATCGTG ACTCCTTGGT TTACGGGGAA CATTATGGCA TCGAGAGCGA AGCAACAACG 6540 
ATATTTCGTG GAACACTCAG ATATGAAGGC ATGAATTCCA TAATCACAAC TCACGACTCA 6600 
CTTCTCCATA TCTGAAGGCT TAACACTTGT TTTCTTTTGG CTTGTACAGG GTTTAGTATG 6660 
ATAATGGCAA CACTTTCGAA ACTTGGATTC TTTGACAGTG AAGCAAATCA AGTACTCTCC 6720 
ACTGGAAAGA GGATTACGTT TGGTGCTCTT TTAAGTAACA TTCTAAATAA GGATGCCGAC 6780 
AATGAATCAG AGCCCCTAGC GGGAGAAGAA GAGATAAGCA AGAGAATTAT CAAGCTTGGA 6840 
CATTCCAAGG AGACTGCAGC CAAAGCTGCC AAAACAATTG TGTAAGCTTC TCCATGAAGA 6900 
TATATAATCT GAATGTTGCA GTGTGATTCC AATTCTTCTA CGAAACTCCT AACCCCAATT 6960 
CTTTTGTGGT GTCTTAGATT CTTGGGGTTC AACGAAGAGA GGGAGGTTCC ATCACTGTGT 7020 
AAAAGCGTAT TTGATGCAAC TTGTTACCTA ATGGAAGAGA AACTAGCTTA TTCCGGAAAT 7080 
GAACAGGTCT CTGTTTCATG TGAAAGCATT AGTTTTCTTC TCTCACTTGT ATTTGGTGTT 7140 
ACTTACTGAC ATAAACTTTG GACAATCTTT TGCATTATGT TTTCAGGACA TGGTGCTTTT 7200 
GCATCACGAA GTAGAAGTGG AATTCCTTGA AAGCAAACGT ATAGAGAAGC ACACTGCGAC 7260 
TCTTTTGGAA TTCGGGGACA TCAAGAATGG GCAAACAACA ACCGCTATGG CCAAGACTGT 7320 
TGGGATCCCT GCAGCCATTG GAGCTCTGGT CCTTACTAAG ACTTTGATCA CCACTTTTTC 7380 
CTGTCTATAT TTCTCTAAAA TGAAAGTTTT AAGCGTTTGT TTTATGATGT TGTGTGTTGC 7440 
AGCTGTTAAT TGAAGACAAG ATCAAGACAA GAGGAGTCTT AAGGCCTTTC GAAGCAGAGG 7500 
TGTATTTGCC AGGTAAATTA GAATTCCGCT TCAAAAGGAT GTGTGTTGCA GATAAAGACA 75 60 
ATGATGTTGA TTTGTTGTGT GTTTGGGATA TGTGGTGTTA TACATACAGC TTTGGATATA 7620 
TTGCAAGCAT ATGGTATAAA GCTGATGGAG AAGGCAGAAT GATCAAAGAA CTCTGTATAT 7680 
TGTTTCTCTC TATAACTTGG AGTTGGAGAC AAAGCTGAAG AAGACAGAGA CATTAGACCA 7740 
GCAAAAAAAG AAGAAGAAGG AAGAAGATAA GCCTCGATCC TTGGGTGACG AGTATCTATA 7800 
TGTTTATATG TACTATATGT TATGTTGTAC AGAAGAAGTC GTGTCCACAA ATATCAATTG 7860 
ATGTCAGATG TCTAGTAAGT GATCATGTGT AGCATACAAA CTGGAGTAAT TTAAAAAGTG 7920 
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AATAAACAAA AATAATTACT AAACGTTATT CCAAGTAGCT TTCCAAGACA GTCACTTGCC 7980 
CTTTTCCAAT TTCCCTTGCA ATTAACTAAA TTGCTCTTCA C GAT AT GAT A T TAT ACC AAA 804 0 
ATGGTGATAC CTTGGGAATT GTTAATTTGA CTCATTTGAA CAAATCTCAT CTATAAAATC 8100 
ATCCCACCTC TCCACCACAT TTGTTCTCAC TACCAATCAA AAAATAATCT AGTCTTAAAC 8160 
(2) INFORMATION FOR SEQ ID NO:111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:111: 

ATGAATTCAA ATGGCCATGA GGAGGAGAAG AAGTTGGGGA ATGGAGTTGT GGGGATTCTA 60 
TCTGAAACAG TTAACAAATG GGAGAGACGA ACACCATTGA CGCCATCGCA TTGCGCTCGC 120 
CTTTTACACG GTGGGAAAGA CAGAACCGGC ATTTCCCGCA TTGTGGTTCA GCCATCTGCT 180 

AAGCGTATCC ATCATGATGC CTTGTATGAA CATGTTGGGT GTGAAATTTC TGATGATTTG 240 

TCTGATTGTG GGCTTATACT TGGAATCAAA CAACCTGAGC TAGAAATGAT TCTTCCAGAG 300 

AGAGCATACG CTTTCTTTTC ACAT AC T CAT AAGGCACAGA AAGAGAACAT GCCTTTGTTG 360 

GATAAAATTC TTTCTGAGAG AGTGACTTTG TGTGATTATG AGCTCATTGT TGGGGATCAT 420 
GGGAAACGAT TATTGGCGTT TGGTAAATAT GCAGGCAGAG CTGGTCTTGT TGACTTCTTA 480 
CACGGACTTG GACAGCGATA TCTAAGTCTA GGATACTCAA CACCTTTCCT CTCGCTCGGT 540 
GCATCGTATA TGTATTCCTC ATTGGCTGCT GCAAAAGCCG CTGTAATTTC TGTTGGTGAA 600 
GAAATTGCAA GCCAGGGACT GCCATTAGGA ATCTGCCCTC TTGTATTTGT CTTCACCGGA 660 
ACAGGAAATG TTTCTCTGGG GGCGCAAGAA ATTTTCAAGC TTCTTCCTCA CACTTTTGTT 720 
GAACCAAGCA AACTTCCTGA ACTATTTGTA AAAGACAAAG GAATTAGTCA AAATGGGATT 7 80 
TCAACAAAGC GAGTCTATCA AGTATATGGT TGTATTATTA CCAGCCAAGA CATGGTTGAA 840 
CACAAAGATC CATCAAAGTC ATTCGACAAA GCCGACTATT ATGCACACCC GGAACATTAC 900 
AATCCAGTTT TCCACGAAAA GATATCGCCA TATACGTCTG TTCTTGTAAA CTGTATGTAC 960 
TGGGAGAAGA GGTTTCCCTG TCTTCTGAGC ACAAAACAGC TTCAAGATTT AACAAAAAAA 1020 
GGACTCCCAC TAGTAGGCAT ATGTGATATA ACTTGTGACA TCGGTGGCTC CATTGAATTT 1080 
GTTAACCGAG CTACTTTAAT CGATTCCCCT TTCTTCAGGT TTAATCCCTC GAACAATTCA 1140 
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TACTACGATG 

ACATGGATGG 

GGATGGCGTA 

CTATGCATGG 

CTGTTGACAT 

TTTACCCACA 

1200 

GAATTTGCAA 

AAGAGGCATC 

CCAGCATTTT 

GGAGATATTC 

TTTCCGGATT 

TGTCGGTAGT 

1260 

TTGGCTTCAA 

TGACTGAAAT 

TTCAGATCTA 

CCAGCACATC 

TGAAGAGGGC 

TTGCATAAGC 

1320 

TATAGGGGAG 

AATTGACATC 

TTTGTATGAG 

TATATTCCAC 

GTATGAGGAA 

G T CAAATCCA 

1380 

GAAGAGGCAC 

AAGATAATAT 

TATCGCCAAC 

GGGGTTTCCA 

G CCAGAGAAC 

ATTCAACATA 

1440 

TTGGTATCTC 

TGAGCGGACA 

CCTATTTGAT 

AAGTTTCTGA 

TAAACGAAGC 

TCTTGATATG 

1500 

ATCGAAGCGG 

CTGGTGGCTC 

ATTTCATTTG 

GCTAAATGTG 

AACTGGGGCA 

GAGCGCTGAT 

1560 

GCTGAATCGT 

AC T CAGAAC T 

TGAAGTTGGT 

GCGGATGATA 

AGAGAGTATT 

GGATCAAATC 

1620 

ATTGATTCAT 

TAACTCGGTT 

AGCTAATCCA 

AATGAAGATT 

ATATATCCCC 

ACATAGAGAA 

1680 

GCAAATAAGA 

TCTCACTGAA 

GATTGGTAAA 

GTCCAGCAAG 

AAAAT GAGAT 

AAAAGAGAAG 

1740 

CCTGAAATGA 

CGAAAAAATC 

AGGTGTTTTG 

ATTCTTGGTG 

CTGGACGTGT 

GTGTCGCCCA 

1800 

GCTGCTGATT 

TCCTAGCTTC 

AG T TAGAAC C 

ATTTCGTCAC 

AGCAATGGTA 

CAAAACATAT 

1860 

TTCGGAGCAG 

ACTCTGAAGA 

GAAAACAGAT 

GTTCATGTGA 

TTGTCGCGTC 

TCTGTATCTT 

1920 

AAGGATGCCA 

AAGAGACGGT 

TGAAGGTATT 

TCAGATGTAG 

AAGCAGTTCG 

GCTAGATGTA 

1980 

TCTGATAGTG 

AAAGTCTCCT 

TAAGTATGTT 

TCTCAGGTTG 

ATGTTGTCCT 

AAGTTTATTA 

2040 

CCTGCAAGTT 

GTCATGCTGT 

TGTAGCAAAG 

ACATGCATTG 

AGCTGAAGAA 

GCATCTCGTC 

2100 

ACTGCTAGCT 

ATGTTGATGA 

TGAAACGTCC 

ATGTTACATG 

AGAAGGCTAA 

GAGTGCTGGG 

2160 

ATAACGATTC 

TAGGCGAAAT 

GGGACTGGAC 

CCTGGAATCG 

ATCACATGAT 

GGCGATGAAA 

2220 

AT GAT CAAC G 

ATGCTCATAT 

CAAAAAAGGG 

AAAGTGAAGT 

CTTTTACCTC 

TTATTGTGGA 

2280 

GGGCTTCCCT 

CTCCTGCTGC 

AGCAAATAAT 

CCATTAGCAT 

ATAAATTTAG 

CTGGAACCCT 

2340 

GCTGGAGCAA 

TTCGAGCTGG 

TCAAAACCCC 

GCCAAATACA 

AAAGCAACGG 

CGACATAATA 

2400 

CATGTTGATG 

GGAAGAATCT 

CTATGATTCC 

GCGGCAAGAT 

TCCGAGTACC 

TAATCTTCCA 

2460 

GCTTTTGCAT 

TGGAGTGTTT 

TCCAAATCGT 

GACTCCTTGG 

TTTACGGGGA 

ACAT TATGGC 

2520 

ATCGAGAGCG 

AAGCAACAAC 

GATATTTCGT 

G GAACAC T CA 

GATATGAAGG 

GTTTAGTATG 

2580 

ATAATGGCAA 

CACTTTCGAA 

ACTTGGATTC 

TTTGACAGTG 

AAGCAAATCA 

AGTACTCTCC 

2640 

ACTGGAAAGA 

GGATTACGTT 

TGGTGCTCTT 

TTAAGTAACA 

TTCTAAATAA 

GGATGCAGAC 

2700 

AATGAATCAG 

AGCCCCTAGC 

GGGAGAAGAA 

GAGATAAGCA 

AGAGAATTAT 

CAAGCTTGGA 

2760 

CATTCCAAGG 

AGACTGCAGC 

CAAAGCTGCC 

AAAACAATTG 

TATTCTTGGG 

GTTCAACGAA 

2820 

GAGAG G GAG G 

TTCCATCACT 

GTGTAAAAGC 

GTATTTGATG 

CAACTTGTTA 

CCTAATGGAA 

2880 
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GAGAAACTAG 

CTTATTCCGG 

AAATGAACAG 

GACATGGTGC 

TTTTGCATCA 

CGAAGTAGAA 

2940 

GTGGAATTCC 

TTGAAAGCAA 

ACGTATAGAG 

AAGCACACT G 

CGACTCTTTT 

GGAATTCGGG 

3000 

GACATCAAGA 

AT GGACAAAC 

AACAACCGCT 

ATGGCCAAGA 

CTGTTGGGAT 

CCCTGCAGCC 

3060 

ATTGGAGCTC 

TGGTGTTAAT 

TGAAGACAAG 

ATCAAGACAA 

GAGGAGTCTT 

AAGGCCTCTC 

3120 

GAAGCAGAGG 

TGTATTTGCC 

AGCTTTGGAT 

ATATTGCAAG 

CATATGGTAT 

AAAGCTGATG 

3180 

GAGAAGGCAG 

AATGA 





3195 


(2) INFORMATION FOR SEQ ID NO:112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: 


Met 

Asn 

Ser 

Asn 

Gly 

His 

Glu 

Glu 

Glu 

Lys 

Lys 

Leu 

Gly 

Asn 

Gly 

Val 

1 




5 





10 





15 


Val 

Gly 

lie 

Leu 

Ser 

Glu 

Thr 

Val 

Asn 

Lys 

Trp 

Glu 

Arg 

Arg 

Thr 

Pro 




20 





25 





30 



Leu 

Thr 

Pro 

Ser 

His 

Cys 

Ala 

Arg 

Leu 

Leu 

His 

Gly 

Gly 

Lys 

Asp 

Arg 



35 





40 





45 




Thr 

Gly 

lie 

Ser 

Arg 

He 

Val 

Val 

Gin 

Pro 

Ser 

Ala 

Lys 

Arg 

lie 

His 


50 





55 





60 





His 

Asp 

Ala 

Leu 

Tyr 

Glu 

His 

Val 

Gly 

Cys 

Glu 

lie 

Ser 

Asp 

Asp 

Leu 

65 





70 





75 





80 

Ser 

Asp 

Cys 

Gly 

Leu 

lie 

Leu 

Gly 

lie 

Lys 

Gin 

Pro 

Glu 

Leu 

Glu 

Met 





85 





90 





95 


lie 

Leu 

Pro 

Glu 

Arg 

Ala 

Tyr 

Ala 

Phe 

Phe 

Ser 

His 

Thr 

His 

Lys 

Ala 




100 





105 





110 



Gin 

Lys 

Glu 

Asn 

Met 

Pro 

Leu 

Leu 

Asp 

Lys 

lie 

Leu 

Ser 

Glu 

Arg 

Val 



115 





120 





125 




Thr 

Leu 

Cys 

Asp 

Tyr 

Glu 

Leu 

lie 

Val 

Gly 

Asp 

His 

Gly 

Lys 

Arg 

Leu 


130 





135 





140 





Leu 

Ala 

Phe 

Gly 

Lys 

Tyr 

Ala 

Gly 

Arg 

Ala 

Gly 

Leu 

Val 

Asp 

Phe 

Leu 

145 





150 





155 





160 

His 

Gly 

Leu 

Gly 

Gin 

Arg 

Tyr 

Leu 

Ser 

Leu 

Gly 

Tyr 

Ser 

Thr 

Pro 

Phe 





165 





170 





175 
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Leu Ser Leu Gly Ala Ser Tyr Met 
180 

Ala Ala Val lie Ser Val Gly Glu 
195 200 

Leu Gly lie Cys Pro Leu Val Phe 
210 215 

Ser Leu Gly Ala Gin Glu lie Phe 
225 230 

Glu Pro Ser Lys Leu Pro Glu Leu 
245 

Gin Asn Gly lie Ser Thr Lys Arg 
260 

lie Thr Ser Gin Asp Met Val Glu 
275 280 

Asp Lys Ala Asp Tyr Tyr Ala His 
290 295 

His Glu Lys lie Ser Pro Tyr Thr 
305 310 

Trp Glu Lys Arg Phe Pro Cys Leu 
325 

Leu Thr Lys Lys Gly Leu Pro Leu 
340 

Asp lie Gly Gly Ser lie Glu Phe 
355 360 

Ser Pro Phe Phe Arg Phe Asn Pro 
370 375 

Met Asp Gly Asp Gly Val Leu Cys 
385 390 

Glu Phe Ala Lys Glu Ala Ser Gin 
405 

Phe Val Gly Ser Leu Ala Ser Met 
420 

His Leu Lys Arg Ala Cys lie Ser 
435 440 

Tyr Glu Tyr lie Pro Arg Met Arg 
450 455 

Asp Asn lie lie Ala Asn Gly Val 
465 470 


Tyr Ser Ser Leu Ala Ala Ala Lys 
185 190 

Glu lie Ala Ser Gin Gly Leu Pro 
205 

Val Phe Thr Gly Thr Gly Asn Val 
220 

Lys Leu Leu Pro His Thr Phe Val 
235 240 

Phe Val Lys Asp Lys Gly lie Ser 
250 255 

Val Tyr Gin Val Tyr Gly Cys lie 
265 270 

His Lys Asp Pro Ser Lys Ser Phe 
285 

Pro Glu His Tyr Asn Pro Val Phe 
300 

Ser Val Leu Val Asn Cys Met Tyr 
315 320 

Leu Ser Thr Lys Gin Leu Gin Asp 
330 335 

Val Gly lie Cys Asp lie Thr Cys 
345 350 

Val Asn Arg Ala Thr Leu lie Asp 
365 

Ser Asn Asn Ser Tyr Tyr Asp Asp 
380 

Met Ala Val Asp lie Leu Pro Thr 
395 400 

His Phe Gly Asp lie Leu Ser Gly 
410 415 

Thr Glu lie Ser Asp Leu Pro Ala 
425 430 

Tyr Arg Gly Glu Leu Thr Ser Leu 
445 

Lys Ser Asn Pro Glu Glu Ala Gin 
460 

Ser Ser Gin Arg Thr Phe Asn lie 
475 480 
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Leu Val Ser Leu Ser Gly His Leu 
485 

Ala Leu Asp Met lie Glu Ala Ala 
500 

Cys Glu Leu Gly Gin Ser Ala Asp 
515 520 

Val Gly Ala Asp Asp Lys Arg Val 
530 535 

Thr Arg Leu Ala Asn Pro Asn Glu 
545 550 

Ala Asn Lys lie Ser Leu Lys lie 
565 

lie Lys Glu Lys Pro Glu Met Thr 
580 

Gly Ala Gly Arg Val Cys Arg Pro 
595 600 

X 

Arg Thr lie Ser Ser Gin Gin Trp 
610 615 

Ser Glu Glu Lys Thr Asp Val His 
625 630 

Lys Asp Ala Lys Glu Thr Val Glu 
645 

Arg Leu Asp Val Ser Asp Ser Glu 
660 

Val Asp Val Val Leu Ser Leu Leu 
675 680 

Ala Lys Thr Cys lie Glu Leu Lys 
690 695 

Val Asp Asp Glu Thr Ser Met Leu 
705 710 

lie Thr lie Leu Gly Glu Met Gly 
725 

Met Ala Met Lys Met lie Asn Asp 
740 

Lys Ser Phe Thr Ser Tyr Cys Gly 
755 760 

Asn Asn Pro Leu Ala Tyr Lys Phe 
770 775 


Phe Asp Lys Phe Leu lie Asn Glu 
490 495 

Gly Gly Ser Phe His Leu Ala Lys 
505 510 

Ala Glu Ser Tyr Ser Glu Leu Glu 
525 

Leu Asp Gin lie lie Asp Ser Leu 
540 

Asp Tyr lie Ser Pro His Arg Glu 
555 560 

Gly Lys Val Gin Gin Glu Asn Glu 
570 575 

Lys Lys Ser Gly Val Leu lie Leu 
585, 590 

Ala Ala Asp Phe Leu Ala Ser Val 
605 

Tyr Lys Thr Tyr Phe Gly Ala Asp 
620 

Val lie Val Ala Ser Leu Tyr Leu 
635 640 

Gly lie Ser Asp Val Glu Ala Val 
650 655 

Ser Leu Leu Lys Tyr Val Ser Gin 
665 670 

Pro Ala Ser Cys His Ala Val Val 
685 

Lys His Leu Val Thr Ala Ser Tyr 
700 

His Glu Lys Ala Lys Ser Ala Gly 
715 720 

Leu Asp Pro Gly lie Asp His Met 
730 735 

Ala His lie Lys Lys Gly Lys Val 
745 750 

Gly Leu Pro Ser Pro Ala Ala Ala 
765 

Ser Trp Asn Pro Ala Gly Ala lie 
780 
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Arg Ala Gly Gin Asn Pro Ala Lys Tyr Lys Ser Asn Gly Asp lie lie 

785 790 795 800 

His Val Asp Gly Lys Asn Leu Tyr Asp Ser Ala Ala Arg Phe Arg Val 

805 810 815 

Pro Asn Leu Pro Ala Phe Ala Leu Glu Cys Phe Pro Asn Arg Asp Ser 
820 825 830 

Leu Val Tyr Gly Glu His Tyr Gly lie Glu Ser Glu Ala Thr Thr lie 
835 840 845 

Phe Arg Gly Thr Leu Arg Tyr Glu Gly Phe Ser Met lie Met Ala Thr 
850 855 860 

Leu Ser Lys Leu Gly Phe Phe Asp Ser Glu Ala Asn Gin Val Leu Ser 

865 870 875 880 

Thr Gly Lys Arg lie Thr Phe Gly Ala Leu Leu Ser Asn lie Leu Asn 

885 890 895 

Lys Asp Ala Asp Asn Glu Ser Glu Pro Leu Ala Gly Glu Glu Glu lie 
900 905 910 

Ser Lys Arg lie lie Lys Leu Gly His Ser Lys Glu Thr Ala Ala Lys 
915 920 925 

Ala Ala Lys Thr lie Val Phe Leu Gly Phe Asn Glu Glu Arg Glu Val 
930 935 940 

Pro Ser Leu Cys Lys Ser Val Phe Asp Ala Thr Cys Tyr Leu Met Glu 

945 950 955 960 

Glu Lys Leu Ala Tyr Ser Gly Asn Glu Gin Asp Met Val Leu Leu His 

965 970 975 

His Glu Val Glu Val Glu Phe Leu Glu Ser Lys Arg lie Glu Lys His 
980 985 990 

Thr Ala Thr Leu Leu Glu Phe Gly Asp lie Lys Asn Gly Gin Thr Thr 
995 1000 1005 

Thr Ala Met Ala Lys Thr Val Gly He Pro Ala Ala lie Gly Ala Leu 
1010 1015 1020 

Val Leu lie Glu Asp Lys lie Lys Thr Arg Gly Val Leu Arg Pro Leu 

1025 1030 1035 1040 

Glu Ala Glu Val Tyr Leu Pro Ala Leu Asp lie Leu Gin Ala Tyr Gly 

1045 1050 1055 

lie Lys Leu Met Glu Lys Ala Glu 
1060 
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(2) INFORMATION FOR SEQ ID NO:113: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


FEATURE: 



(A) 

NAME/KEY: 

modified 

base 

(B) 

LOCATION: 

6 


(D) 

OTHER INFORMATION: 

/mod_base=i 

FEATURE: 



(A) 

NAME/KEY: 

modif ied_ 

_base 

(B) 

LOCATION: 

12 


(D) 

OTHER INFORMATION: 

/mod base=i 


(ix) FEATURE: 

(A) NAME/KEY: modified_base 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod_base=i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 

TTYTCNCAYA CNCAYAARGC NCA 23 

(2) INFORMATION FOR SEQ ID NO:114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: 

TTYTCCCART ACATRCARTT 20 

(2) INFORMATION FOR SEQ ID NO:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: 

GAAAACATGC CTTTGCTGGA TAAGATTCTA GCTGAGAGGG CATCGTTATA TGACTATGAA 60 

TTAATTGTTG GGGACACTGG GAAAAGGTTA CTTGCATTTG GAAAATTCGC TGGTAGGGCT 120 
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GGAATGATCG 

ACTTTTTGCG 

CGGATTAGGA 

CAGCGGTTTT 

TAAGTCTTGG 

ATATTCAACA 

180 

CCTTTCTTGT 

CACTTGGATC 

ATCTTACATG 

TACCCTTCCC 

TGGCTGCTGC 

TAAGGCTGCT 

240 

GTGATTTCTG 

TTGGTGAAAA 

ATTGCGACGC 

AGGGATTGCC 

ATTGGGGATT 

TGTCCCCTGG 

300 

TTTGTTTATT 

TACTGGTTCA 

GGAAATGTTT 

GTTCTGGTGC 

ACAGGAGATA 

TTTAAGCTTC 

360 

TTCCTCATAC 

ctttgttgat 

CCATCTAAAC 

TACGCGACCT 

ACATAGAACG 

GACCCAGATC 

420 

AACCAAGGCA 

TGCTTCAAAA 

AGAGTTTTCC 

AAGTTTATGG 

TTGTGTTGTG 

ACTGCCCAAG 

480 

ACATGGTTGA 

ACCCAAAGAT 

CACGTGATAG 

TGTTTGACAA 

AGCAGACTAC 

TATGCACATC 

540 

CTGAGCATTA 

CAATCCCACT 

TTCCATGAAA 

AAATAGCACC 

ATATGCATCT 

gttattgtca 

600 

ATTGCATGTA 

TTGGGAAAA 
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(2) INFORMATION 

FOR SEQ ID 

NO:116: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: 

DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ 

ID NO:116: 



GAGAATATGC 

CACTGTTAGA 

CAAGATCCTT 

GAAGAAAGGG 

TGTCCTTGTT 

TGATTATGAG 

60 

CTAATTGTTG 

GAGATGATGG 

GAAAAGAT CA 

CTAGCATTTG 

GGAAATTTGC 

TGGTAGAGCT 

120 

GGACTGATAG 

ATTTCTTACA 

TGGTCTCGGA 

CAGCGATATT 

TGAGCCTTGG 

ATACTCCACT 

180 

CCATTTCTCT 

CTCTGGGACA 

TCTCATATGT 

TCCTTCGCTC 

GCTGCAGCCA 

AGGCTGCAGT 

240 

CATTGTCGTT 

GCAGAAGAGA 

TAGCAACATT 

TGGACTTCCA 

TCCGGAATTT 

GTCCGATAGT 

300 

GTTTGTGTTC 

ACTGGAGTTG 

GAAACGTCTC 

TCAGGGTGCG 

CAGGAGATAT 

TCAAGTTATT 

360 

GCCCCATACC 

TTTGTTGATG 

CTGAGAAGCT 

TCCCGAAATT 

TTTCAGGCCA 

GGAATCTGTC 

420 

TAAGCAATCT 

CAGTCGACCA 

AGAGAGTATT 

TCAACTTTAT 

GGTTGTGTTG 

TGACCTCTAG 

480 

AGACATAG T T 

TCTCACAAGG 

ATCCCACCAG 

ACAATTTGAC 

AAAGGTGACT 

ATTATGCTCA 

540 

TCCAGAACAC 

TACACCCCTG 

TTTTTCATGA 

AAGAATTGCT 

CCATATGCAT 

CTGTCATCGT 

600 

AAACTGCATG 

TATTGGGAAA 
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(2) INFORMATION FOR SEQ ID NO:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:117: 

Glu Asn Met Pro Leu Leu Asp Lys lie Leu Ala Glu Arg Ala Ser Leu 
15 10 15 

Tyr Asp Tyr Glu Leu lie Val Gly Asp Thr Gly Lys Arg Leu Leu Ala 

20 25 30 

Phe Gly Lys Phe Ala Gly Arg Ala Gly Met lie Asp Phe Leu Arg Gly 

35 40 45 

Leu Gly Gin Arg Phe Leu Ser Leu Gly Tyr Ser Thr Pro Phe Leu Ser 

50 55 60 

Leu Gly Ser Ser Tyr Met Tyr Pro Ser Leu Ala Ala Ala Lys Ala Ala 

65 70 75 80 

Val lie Ser Val Gly Glu Xaa lie Ala Thr Gin Gly Leu Pro Leu Gly 

85 90 95 

lie Cys Pro Leu Val Cys Leu Phe Thr Gly Ser Gly Asn Val Cys Ser 

100 105 110 

Gly Ala Gin Glu lie Phe Lys Leu Leu Pro His Thr Phe Val Asp Pro 

115 120 125 

Ser Lys Leu Arg Asp Leu His Arg Thr Asp Pro Asp Gin Pro Arg His 

130 135 140 

Ala Ser Lys Arg Val Phe Gin Val Tyr Gly Cys Val Val Thr Ala Gin 

145 150 155 160 

Asp Met Val Glu Pro Lys Asp His Val lie Val Phe Asp Lys Ala Asp 

165 170 175 

Tyr Tyr Ala His Pro Glu His Tyr Asn Pro Thr Phe His Glu Lys lie 

180 185 190 

Ala Pro Tyr Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu 

195 200 205 

(2) INFORMATION FOR SEQ ID NO:118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 
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Glu Asn Met Pro Leu Leu Asp Lys lie Leu Glu Glu Arg Val Ser Leu 
15 10 15 

Phe Asp Tyr Glu Leu lie Val Gly Asp Asp Gly Lys Arg Ser Leu Ala 
20 25 30 

Phe Gly Lys Phe Ala Gly Arg Ala Gly Leu lie Asp Phe Leu His Gly 
35 40 45 

Leu Gly Gin Arg Tyr Leu Ser Leu Gly Tyr Ser Thr Pro Phe Leu Ser 
50 55 60 

Leu Gly Xaa Ser His Met Xaa Pro Ser Leu Ala Ala Ala Lys Ala Ala 

65 70 75 80 

Val lie Val Val Ala Glu Glu lie Ala Thr Phe Gly Leu Pro Ser Gly 

85 90 95 

lie Cys Pro lie Val Phe Val Phe Thr Gly Val Gly Asn Val Ser Gin 
100 105 110 

Gly Ala Gin Glu lie Phe Lys Leu Leu Pro His Thr Phe Val Asp Ala 
115 120 125 

Glu Lys Leu Pro Glu lie Phe Gin Ala Arg Asn Leu Ser Lys Gin Ser 
130 135 140 

Gin Ser Thr Lys Arg Val Phe Gin Leu Tyr Gly Cys Val Val Thr Ser 

145 150 155 160 

Arg Asp lie Val Ser His Lys Asp Pro Thr Arg Gin Phe Asp Lys Gly 

165 170 175 

Asp Tyr Tyr Ala His Pro Glu His Tyr Thr Pro Val Phe His Glu Arg 
180 185 190 

lie Ala Pro Tyr Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu 
195 200 205 

(2) INFORMATION FOR SEQ ID NO:119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..2357 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: 

TTGAACCCAA AGATCACGTG ATAGTGTTTG ACAAAGCAGA CTACTATTCA CACCCTGAGC 60 
ATTACAATCC CACTTTCCAT GAAAAAATAG CACCATATGC ATCTGTTATT GTCAATTGCA 120 

TGTATTGGGA GAAAAGATTT CCTCAATTGC CGAGCTATAA GCAGATGCAA GACTTAATGG 180 

GCCGGGGGAG CCCCCTTGTT GGAATAGCTG ACATAACGTG TGATATAGGG GGTTCAATTG 240 

AGTTTGTTAA CCGCGGTACT TCAATTGATT CACCCTTCTT CAGATATGAT CCCTTAACAA 300 

ATTCCTACCA TGATGATATG GAGGGGAATG GAGTGATATG CTTAGCTGTT GACATTCTTC 360 

CAACAGAATT TGCAAAGGAG GCTTCCCAAC ATTTTGGAAA CATACTTTCC CAATTTGTTG 420 

TAAATTTGGC TTCTGCTACA GACATTACAA AGTTGCCTGC TCACTTAAGG AGAGCTTGCA 4 80 

TAGCCCATAA AGGAGTGCTA ACCTCCTTAT ATGATTATAT CCCACGCATG CGGAGTTCTG 540 

AT T C AG AG G A AGTATCAGAA AAC G C AG AAA ATTCTCTATC CAACAAAAGG AAGTACAATA 600 

TATCGGTGTC TCTGAGTGGT CACTTATTTG ATCAGTTTCT GATAAATGAG GCCTTAGATA 660 
TTATTGAAGC TGCAGGAGGC TCCTTCCACT TAGTCAACTG CCATGTGGGT CAGAGCATTG 720 
AAGCCGTATC ATTCTCTGAA CTTGAAGTTG GTGCAGATAA CAGGGCTGTT CTGGATCAAA 780 
TCATTGATTC TTTAACTGCT ATTGCTAGTC CAACTGAACA TGATAGATTT TCAAATCAAG 840 
ATTCAAGTAA AATTTCACTT AAGCTTGGTA AAGTTGAAGA GAATGGCATA GAGAAGGAAT 900 
CTGACCCCAG AAAGAAGGCT GCGGTTTTAA TTCTTGGAGC TGGTCGGGTC TGTCAACCAG 960 
CTGCTGAAAT GTTATCATCA TTTGGAAGGC CATCATCGAG CCAATGGTAT AAAACATTGT 1020 
TGGAAGATGA TTTTGAATGT CAAACTGATG TAGAAGTCAT TGTGGGATCT CTGTACCTGA 108 0 
AGGATGCAGA GCAGACTGTT GAGGGCATTC CAAATGTAAC CGGAATTCAG CTTGATGTGA 1140 
TGGATCGTGC CAATTTGTGT AAGTACATTT CACAGGTTGA CGTTGTTATA AGTTTGCTGC 1200 
CCCCAAGTTG TCATATTATT GTAGCAAATG CTTGCATTGA GCTGAAAAAA CATCTTGTCA 1260 
CTGCTAGCTA TGTTGATAGC TCCATGTCAA TGCTAAATGA TAAGGCTAAA GATGCTGGCA 1320 
TAACAATTCT TGGAGAGATG GGCTTGGACC CAGGAATTGG TCATATGATG GCAATGAAGA 1380 
TGATCAACCA AGCACATGTG AGGAAGGGGA AAATAAAGTC TTTCACTTCT TATTGTGGTG 1440 
GACTTCCATC TCCTGAAGCT GCTAACAATC CATTAGCATA TAAATTCAGT TGGAATCCTG 1500 
CAGGAGCCAT CCGAGCTGGG CGCAATCCTG CCACCTACAA ATGGGGTGGT GAAACTGTAC 1560 
ATATTGATGG GGACGATCTT TATGATTCGG CTACAAGACT AAGGCTACCG GACCTTCCTG 1620 
CTTTTGCTTT GGAATGTCTC CCAAATCGCA ATTCATTACT TTATGGGGAT TTGTATGGAA 1680 
TAACTGAAGC ATCAACCATT TTCCGTGGAA CCCTCCGCTA TGAAGGATTT AGTGAGATCA 1740 
TGGGGACACT GTCTAGGATT AGCTTATTTA ACAATGAAGC CCATTCGTTG CTAATGAATG 1800 
GACAAAGACC AACTTTCAAA AAATTCTTAT TTGAACTTCT CAAAGTTGTT GGTGATAATC 1860 
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CAGATGAACT ATTGATAGGA GAGAATGACA TCATGGAGCA AATATTAATA CAAGGGCACT 1920 

GCAAAGATCA AAGAACGGCA ATGGAGACAG CAAAAACAAT CATTTTCTTG GGACTTCTTG 1980 

ACCAAACTGA AATCCCTGCT TCCTGCAAAA GTGCTTTTGA TGTTGCTTGT TTCCGCATGG 2040 

AGGAGAGGTT ATCATACACC AGCACAGAAA AGGATATGGT GCTTTTGCAT CATGAAGTGG 2100 

AAATAGAATA CCCAGATAGC CAAATTACAG AGAAGCATAG AGCTACTTTA CTTGAATTTG 2160 

GGAAGACTCT TGATGAAAAA ACCACAACTG CCATGGCCCT TACTGTTGGT ATTCCAGCTG 2220 

CTGTTGGAGC TTTGCTTTTA TTGACAAACA AAATTCAGAC AAGAGGAGTC TTAAGGCCTA 2280 

TCGAACCTGA AGTATACAAT CCAGCACTGG ATATTATAGA AGCTTATGGG ATCAAGTTGA 2340 

TAGAGAAGAC CGAGTAATTT GCATYTATGA ATTGATGTAT AGGTGTACAT TAATGTACAC 2400 

CATGCAATGT TTGATTTGAA TAAGATAAAA TATAATAATT ACTGCAGTCA TGGAATTGCA 2460 

ACTGCCATTC TATGCAACTG TCAGAAATGG ACCACACGGT ACCAGCATAG TTAAAACACT 2520 

TAGGCAGATA CCAATTTCAA TTGCAGCAGT ACAATCCAAC CAGTTATGAA GTATGGTTCT 2580 

AG 2582 

(2) INFORMATION FOR SEQ ID NO:120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3..3071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:120: 

ATTGTGCCCG CCTTCTGCTA GGAGGAGGCA AGAACGGACC TCGAGTAAAC CGGATTATTG 60 

TGCAGCCAAG CACAAGGAGG ATCCATCATG ACGCTCAGTA TGAGGATGCA GGATGCGAGA 120 

TTTCAGAAGA CCTGTCAGAA TGCGGCCTTA TCATAGGCAT CAAACAACCC AAGCTGCAGA 180 

TGATTCTTTC AGATAGAGCG TACGCTTTCT TTTCACACAC ACACAAAGCC CAAAAAGAGA 24 0 

ATATGCCACT GTTAGACAAG ATCCTTGAAG AAAGGGTGTC CTTGTTTGAT TATGAGCTAA 300 

TTGTTGGAGA TGATGGGAAA AGATCACTAG CATTTGGGAA ATTTGCTGGT AGAGCTGGAC 360 

TGATAGATTT CTTACATGGT CTCGGACAGC GATATTTGAG CCTTGGATAC TCGACTCCAT 420 

TTCTCTCTCT GGGACAATCT CATATGTATC CTTCGCTCGC TGCAGCCAAG GCTGCAGTCA 480 
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TTGTCGTTGC 

AGAAGAGATA 

GCAACATTTG 

GACTTCCATC 

CGGAATTTGT 

CCGATAGTGT 

540 

TTGTGTTCAC 

TGGAGTTGGA 

AACGTCTCTC 

AGGGTGCGCA 

GGAGATATTC 

AAGTTATTGC 

600 

CCCATACCTT 

TGTTGATGCT 

GAGAAGCTTC 

CCGAAATTTT 

TCAGGCCAGG 

AATCTGTCTA 

660 

AGCAATCTCA 

GTCGACCAAG 

AGAG TAT T T C 

AACTTTATGG 

TTGTGTTGTG 

ACCTCTAGAG 

720 

ACATAGTTTC 

T CACAAGGAT 

CCCACCAGAC 

AATTTGACAA 

AGGTGACTAT 

TATGCTCATC 

780 

CAGAACACTA 

CACCCCTGTT 

TTTCATGAAA 

GAATTGCTCC 

ATATGCATCT 

GTCATCGTAA 

840 

ACTGTATGTA 

TTGGGAGAAG 

AGGTTTCCAC 

CAT TAC TAAA 

TATGGATCAG 

TTACAGCAAT 

900 

T GAT G GAG AC 

TGGTTGTCCT 

TTAGTCGGCG 

TTTGTGACAT 

AACTTGTGAT 

ATTGGAGGTT 

960 

CCATTGAATT 

TATCAACAAG 

AG TACAT CAA 

TAGAGAGGCC 

TTTCTTTCGG 

TATGATCCTT 

1020 

CTAAGAATTC 

ATACCATGAT 

GATATGGAAG 

GTGCCGGAGT 

GGTCTGCTTG 

GCTGTTGACA 

1080 

TTCTCCCTAC 

AGAATTCTCT 

AAAGAGGCCT 

CCCAACATTT 

T GGAAACATA 

CTATCTAGAC 

1140 

TTGTTGCTAG 

TTTGGCCTCA 

GTGAAGCAAC 

CGGCAGAACT 

TCCTTCCTAC 

TTGAGAAGAG 

1200 

CTTGCATTGC 

ACATGCTGGC 

AGATTAACTC 

CTTTGTATGA 

ATATATCCCT 

AG GAT GAGAA 

1260 

ATACTATGAT 

AGATTTGGCA 

CCCGCAAAAA 

CAAAT C CAT T 

GCCTGACAAG 

AAGTATAGCA 

1320 

CCCTGGTATC 

TCTCAGTGGG 

CACCTATTTG 

ATAAGTTCCT 

TATAAATGAA 

GCTTTGGACA 

1380 

T CAT T GAG AC 

AGCTGGAGGT 

TCATTTCACT 

TGGTTAGATG 

TGAAGTTGGA 

CAAAGCACGG 

1440 

ATGATATGTC 

ATAC T CAGAG 

CTTGAAGTAG 

GAG CAGAT GA 

TACTGCCACA 

TTGGATAAAA 

1500 

TTATTGATTC 

CTTGACTTCT 

TTAGCTAATG 

AACATGGTGG 

AGAT CAC GAT 

GCCGGGCAAG 

1560 

AAATTGAATT 

AGCTCTGAAG 

ATAGGAAAAG 

TCAATGAGTA 

TGAAACTGAC 

GTCACAATTG 

1620 

ATAAAGGAGG 

GCCAAAGATT 

TTAATTCTTG 

GAGCTGGAAG 

AGTCTGTCGG 

CCAGCTGCTG 

1680 

AGTTTCTGGC 

ATCTTACCCA 

GACATATGTA 

CCTATGGTGT 

T GAT GAC CAT 

GATGCAGATC 

1740 

AAATTCATGT 

TATCGTGGCA 

tctttgtatc 

AAAAAGATGC 

AGAAGAGACA 

GTTGATGGTA 

1800 

TTGAAAATAC 

AACTGCTACC 

CAGCTTGATG 

TTGCTGATAT 

TGGAAGCCTT 

TCAGATCTTG 

1860 

TTTCTCAGGT 

TGAGGTTGTA 

ATTAGCTTGC 

TGCCTGCTAG 

TTTTCATGCT 

GCCATTGCAG 

1920 

GAGTATGCAT 

AGAGTTGAAG 

AAGCACATGG 

TAACGGCAAG 

CTATGTTGAT 

GAATCCATGT 

1980 

CAAACTTGAG 

CCAAGCTGCC 

AAAGATGCAG 

GTGTAACTAT 

ACTTTGTGAA 

ATGGGCCTAG 

2040 

ATCCTGGCAT 

AGATCACTTG 

ATGTCAATGA 

AGATGATTGA 

TGAAGCTCAT 

GCACGAAAGG 

2100 

GAAAAATAAA 

GGCATTTACA 

TCTTACTGTG 

GTGGATTGCC 

ATCTCCAGCT 

GCAGCAAACA 

2160 

ATCCGCTTGC 

CTATAAATTC 

AGTTGGAACC 

CAGCTGGTGC 

ACTCCGGTCA 

GGGAAAAATC 

2220 

CTGCAGTCTA 

CAAATTTCTT 

GGTGAGACGA 

TCCATGTAGA 

TGGTCATAAC 

TTGTATGAAT 

2280 

CAGCAAAGAG 

GCTCAGACTA 

CGAGAGCTTC 

CAGCTTTTGC 

TCTGGAACAC 

TTGCCAAATC 

2340 

GGAATTCCTT 

GATATATGGT 

GACCTTTATG 

GTATCTCCAA 

AGAAGCATCC 

ACCATATATA 

2400 
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GGGCTACTYT 

TCGTTACGAA 

GGTTTTAGTG 

AGATTATGGT 

AACCCTTTCC 

AAAACTGGGT 

2460 

TCTTTGATGC 

TGCAAATCAT 

CCACTGCTGC 

AAGATAC TAG 

TCGTCCAACA 

TATAAGGGTT 

2520 

TCCTTGATGA 

ACTACTGAAT 

AATATCTCCA 

CAATTAACAC 

GGACTTAGAT 

ATTGAAGCTT 

2580 

CTGGTGGATA 

CGATGATGAC 

CTGATTGCCA 

GACTGTTGAA 

GCTCGGGTGT 

TGCAAAAATA 

2640 

AGGAAATAGC 

TGTTAAGACA 

GTCAAAACCA 

TCAAGTTCTT 

GGGACTACAT 

GAAGAGACT C 

2700 

AAATACCTAA 

GGGTTGTTCG 

AGCCCATTTG 

ATGTGATTTG 

CCAGCGAATG 

GAACAGAGGA 

2760 

TGGCCTATGG 

CCACAATGAG 

CAAGACATGG 

TACTGCTCCA 

CCACGAAGTC 

GAGGTGGAAT 

2820 

ACCCGGACGG 

GCAACCCGCC 

GAAAAGCACC 

AAGCGACGCT 

ACTGGAGTTC 

GGGAAGGTTG 

2880 

AAAATGGCAG 

GTCCACCACT 

GCCATGGCGC 

TGACCGTCGG 

CATTCCAGCA 

GCAATAGGGG 

2940 

CCCTGCTATT 

GCTAAAGAAT 

AAGGTCCAGA 

CGAAAGGAGT 

GATCAGGCCT 

CTGCAACCGG 

3000 

AAATCTACGT 

TCCAGCATTG 

GAGATCTTGG 

AGTCGTCGGG 

CATCAAGCTG 

GTTGAGAAAG 

3060 

TGGAGACTTG 

AAAGTTCCCT 

GATACACAGA 

TAAAGATAGT 

AT G AT AT AGC 

AGGGCACATG 

3120 

TATCTTTTGT 

ATTAACTCCG 

TTCTGGAATA 

TATATTTGTG 

AACTAAAATG 

TGACAAATAA 

3180 

AAAGAACGGG 

TGGAGTATAT 

TGTAAGAGAC 

GGCAAAGAAA 

CCTCTGTATA 

TATGACCTGT 

3240 

C GATAT CAAA 

TAATGCCGAT 

CAGTT 
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(2) INFORMATION FOR SEQ ID NO:121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Glycine max 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 

Glu Pro Lys Asp His Val lie Val Phe Asp Lys Ala Asp Tyr Tyr Ser 
15 10 15 

His Pro Glu His Tyr Asn Pro Thr Phe His Glu Lys lie Ala Pro Tyr 
20 25 30 

Ala Ser Val lie Val Asn Cys Met Tyr Trp Glu Lys Arg Phe Pro Gin 
35 40 45 

Leu Pro Ser Tyr Lys Gin Met Gin Asp Leu Met Gly Arg Gly Ser Pro 
50 55 60 

Leu Val Gly lie Ala Asp lie Thr Cys Asp lie Gly Gly Ser lie Glu 
65 70 75 80 
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Phe Val Asn Arg Gly Thr Ser lie 
85 

Pro Leu Thr Asn Ser Tyr His Asp 
100 

Cys Leu Ala Val Asp lie Leu Pro 
115 120 

Gin His Phe Gly Asn lie Leu Ser 
130 135 

Ala Thr Asp lie Thr Lys Leu Pro 
145 150 

Ala His Lys Gly Val Leu Thr Ser 
165 

Arg Ser Ser Asp Ser Glu Glu Val 
180 

Ser Asn Lys Arg Lys Tyr Asn lie 
195 200 

Phe Asp Gin Phe Leu lie Asn Glu 
210 215 

Gly Gly Ser Phe His Leu Val Asn 
225 230 

Ala Val Ser Phe Ser Glu Leu Glu 
245 

Leu Asp Gin lie lie Asp Ser Leu 
260 

His Asp Arg Phe Ser Asn Gin Asp 
275 280 

Gly Lys Val Glu Glu Asn Gly lie 
290 295 

Lys Ala Ala Val Leu lie Leu Gly 
305 310 

Ala Glu Met Leu Ser Ser Phe Gly 
325 

Lys Thr Leu Leu Glu Asp Asp Phe 
340 

lie Val Gly Ser Leu Tyr Leu Lys 
355 360 

lie Pro Asn Val Thr Gly lie Gin 
370 375 

Leu Cys Lys Tyr lie Ser Gin Val 
385 390 

Pro Ser Cys His lie lie Val Ala 
405 

His Leu Val Thr Ala Ser Tyr Val 
420 


Asp Ser Pro Phe Phe Arg Tyr Asp 
90 95 

Asp Met Glu Gly Asn Gly Val lie 
105 110 

Thr Glu Phe Ala Lys Glu Ala Ser 
125 

Gin Phe Val Val Asn Leu Ala Ser 
140 

Ala His Leu Arg Arg Ala Cys lie 
155 160 

Leu Tyr Asp Tyr lie Pro Arg Met 
170 175 

Ser Glu Asn Ala Glu Asn Ser Leu 
185 190 

Ser Val Ser Leu Ser Gly His Leu 
205 

Ala Leu Asp lie lie Glu Ala Ala 
220 

Cys His Val Gly Gin Ser lie Glu 
235 240 

Val Gly Ala Asp Asn Arg Ala Val 
250 255 

Thr Ala lie Ala Ser Pro Thr Glu 
265 270 

Ser Ser Lys lie Ser Leu Lys Leu 
285 

Glu Lys Glu Ser Asp Pro Arg Lys 
300 

Ala Gly Arg Val Cys Gin Pro Ala 
315 320 

Arg Pro Ser Ser Ser Gin Trp Tyr 
330 335 

Glu Cys Gin Thr Asp Val Glu Val 
345 350 

Asp Ala Glu Gin Thr Val Glu Gly 
365 

Leu Asp Val Met Asp Arg Ala Asn 
380 

Asp Val Val lie Ser Leu Leu Pro 
395 400 

Asn Ala Cys lie Glu Leu Lys Lys 
410 415 

Asp Ser Ser Met Ser Met Leu Asn 
425 430 
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Asp Lys Ala Lys Asp Ala Gly lie Thr lie Leu Gly Glu Met Gly Leu 
435 440 445 

Asp Pro Gly lie Gly His Met Met Ala Met Lys Met lie Asn Gin Ala 
450 455 460 

His Val Arg Lys Gly Lys lie Lys Ser Phe Thr Ser Tyr Cys Gly Gly 

465 470 475 480 

Leu Pro Ser Pro Glu Ala Ala Asn Asn Pro Leu Ala Tyr Lys Phe Ser 

485 490 495 

Trp Asn Pro Ala Gly Ala lie Arg Ala Gly Arg Asn Pro Ala Thr Tyr 
500 505 510 

Lys Trp Gly Gly Glu Thr Val His lie Asp Gly Asp Asp Leu Tyr Asp 
515 520 525 

Ser Ala Thr Arg Leu Arg Leu Pro Asp Leu Pro Ala Phe Ala Leu Glu 
530 535 540 

Cys Leu Pro Asn Arg Asn Ser Leu Leu Tyr Gly Asp Leu Tyr Gly lie 

- 545 550 555 560 

Thr Glu Ala Ser Thr lie Phe Arg Gly Thr Leu Arg Tyr Glu Gly Phe 

565 570 575 

Ser Glu lie Met Gly Thr Leu Ser Arg lie Ser Leu Phe Asn Asn Glu 
580 585 590 

Ala His Ser Leu Leu Met Asn Gly Gin Arg Pro Thr Phe Lys Lys Phe 
595 600 605 

Leu Phe Glu Leu Leu Lys Val Val Gly Asp Asn Pro Asp Glu Leu Leu 
610 615 620 

lie Gly Glu Asn Asp lie Met Glu Gin lie Leu lie Gin Gly His Cys 

625 630 635 640 

Lys Asp Gin Arg Thr Ala Met Glu Thr Ala Lys Thr lie lie Phe Leu 

645 650 655 

Gly Leu Leu Asp Gin Thr Glu lie Pro Ala Ser Cys Lys Ser Ala Phe 
660 665 670 

Asp Val Ala Cys Phe Arg Met Glu Glu Arg Leu Ser Tyr Thr Ser Thr 
675 680 685 

Glu Lys Asp Met Val Leu Leu His His Glu Val Glu lie Glu Tyr Pro 
690 695 700 

Asp Ser Gin lie Thr Glu Lys His Arg Ala Thr Leu Leu Glu Phe Gly 

705 710 715 720 

Lys Thr Leu Asp Glu Lys Thr Thr Thr Ala Met Ala Leu Thr Val Gly 

725 730 735 

lie Pro Ala Ala Val Gly Ala Leu Leu Leu Leu Thr Asn Lys lie Gin 
740 745 750 

Thr Arg Gly Val Leu Arg Pro lie Glu Pro Glu Val Tyr Asn Pro Ala 
755 760 765 

Leu Asp lie lie Glu Ala Tyr Gly lie Lys Leu lie Glu Lys Thr Glu 
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770 


775 


780 


(2) INFORMATION'FOR SEQ ID NO:122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 


Cys 

1 

Ala 

Arg 

Leu 

Leu 

5 

Leu 

Gly 

Gly 

Gly 

Lys 

10 

Asn 

Gly 

Pro 

Arg 

Val 

15 

Asn 

Arg 

lie 

lie 

Val 

20 

Gin 

Pro 

Ser 

Thr 

Arg 

25 

Arg 

lie 

His 

His 

Asp 

30 

Ala 

Gin 

Tyr 

Glu 

Asp 

35 

Ala 

Gly 

Cys 

Glu 

lie 

40 

Ser 

Glu 

Asp 

Leu 

Ser 

45 

Glu 

Cys 

Gly 

Leu 

lie 

50 

lie 

Gly 

lie 

Lys 

Gin 

55 

Pro 

Lys 

Leu 

Gin 

Met 

60 

lie 

Leu 

Ser 

Asp 

Arg 

65 

Ala 

Tyr 

Ala 

Phe 

Phe 

70 

Ser 

His 

Thr 

His 

Lys 

75 

Ala 

Gin 

Lys 

Glu 

Asn 

80 

Met 

Pro 

Leu 

Leu 

Asp 

85 

Lys 

lie 

Leu 

Glu 

Glu 

90 

Arg 

Val 

Ser 

Leu 

Phe 

95 

Asp 

Tyr 

Glu 

Leu 

lie 

100 

Val 

Gly 

Asp 

Asp 

Gly 

105 

Lys 

Arg 

Ser 

Leu 

Ala 

110 

Phe 

Gly 

Lys 

Phe 

Ala 

115 

Gly 

Arg 

Ala 

Gly 

Leu 

120 

lie 

Asp 

Phe 

Leu 

His 

125 

Gly 

Leu 

Gly 

Gin 

Arg 

130 

Tyr 

Leu 

Ser 

Leu 

Gly 

135 

Tyr 

Ser 

Thr 

Pro 

Phe 

140 

Leu 

Ser 

Leu 

Gly 

Gin 

145 

Ser 

His 

Met 

Tyr 

Pro 

150 

Ser 

Leu 

Ala 

Ala 

Ala 

155 

Lys 

Ala 

Ala 

Val 

lie 

160 

Val 

Val 

Ala 

Glu 

Glu 

165 

lie 

Ala 

Thr 

Phe 

Gly 

170 

Leu 

Pro 

Ser 

Gly 

lie 

175 

Cys 

Pro 

He 

Val 

Phe 

180 

Val 

Phe 

Thr 

Gly 

Val 

185 

Gly 

Asn 

Val 

Ser 

Gin 

190 

Gly 

Ala 

Gin 

Glu 

lie 

195 

Phe 

Lys 

Leu 

Leu 

Pro 

200 

His 

Thr 

Phe 

Val 

Asp 

205 

Ala 

Glu 

Lys 

Leu 

Pro 

210 

Glu 

lie 

Phe 

Gin 

Ala 

215 

Arg 

Asn 

Leu 

Ser 

Lys 

220 

Gin 

Ser 

Gin 

Ser 

Thr 

225 

Lys 

Arg 

Val 

Phe 

Gin 

230 

Leu 

Tyr 

Gly 

Cys 

Val 

235 

Val 

Thr 

Ser 

Arg 

Asp 

240 

lie 

Val 

Ser 

His 

Lys 

Asp 

Pro 

Thr 

Arg 

Gin 

Phe 

Asp 

Lys 

Gly Asp 

Tyr 
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245 250 255 


Tyr 

Ala 

His 

Pro 

260 

Glu 

His 

Tyr 

Thr 

Pro 

265 

Val 

Phe 

His 

Glu 

Arg 

270 

lie 

Ala 

Pro 

Tyr 

Ala 

275 

Ser 

Val 

lie 

Val 

Asn 

280 

Cys 

Met 

Tyr 

Trp 

Glu 

285 

Lys 

Arg 

Phe 

Pro 

Pro 

290 

Leu 

Leu 

Asn 

Met 

Asp 

295 

Gin 

Leu 

Gin 

Gin 

Leu 

300 

Met 

Glu 

Thr 

Gly 

Cys 

305 

Pro 

Leu 

Val 

Gly 

Val 

310 

Cys 

Asp 

lie 

Thr 

Cys 

315 

Asp 

lie 

Gly 

Gly 

Ser 

320 

He 

Glu 

Phe 

lie 

Asn 

325 

Lys 

Ser 

Thr 

Ser 

lie 

330 

Glu 

Arg 

Pro 

Phe 

Phe 

335 

Arg 

Tyr 

Asp 

Pro 

Ser 

340 

Lys 

Asn 

Ser 

Tyr 

His 

345 

Asp 

Asp 

Met 

Glu 

Gly 

350 

Ala 

Gly 

Val 

Val 

Cys 

355 

Leu 

Ala 

Val 

Asp 

lie 

360 

Leu 

Pro 

Thr 

Glu 

Phe 

365 

Ser 

Lys 

Glu 

Ala 

Ser 

370 

Gin 

His 

Phe 

Gly 

Asn 

375 

lie 

Leu 

Ser 

Arg 

Leu 

380 

Val 

Ala 

Ser 

Leu 

Ala 

385 

Ser 

Val 

Lys 

Gin 

Pro 

390 

Ala 

Glu 

Leu 

Pro 

Ser 

395 

Tyr 

Leu 

Arg 

Arg 

Ala 

400 

Cys 

lie 

Ala 

His 

Ala 

405 

Gly 

Arg 

Leu 

Thr 

Pro 

410 

Leu 

Tyr 

Glu 

Tyr 

lie 

415 

Pro 

Arg 

Met 

Arg 

Asn 

420 

Thr 

Met 

lie 

Asp 

Leu 

425 

Ala 

Pro 

Ala 

Lys 

Thr 

430 

Asn 

Pro 

Leu 

Pro 

Asp 

435 

Lys 

Lys 

Tyr 

Ser 

Thr 

440 

Leu 

Val 

Ser 

Leu 

Ser 

445 

Gly 

His 

Leu 

Phe 

Asp 

450 

Lys 

Phe 

Leu 

lie 

Asn 

455 

Glu 

Ala 

Leu 

Asp 

lie 

460 

lie 

Glu 

Thr 

Ala 

Gly 

465 

Gly 

Ser 

Phe 

His 

Leu 

470 

Val 

Arg 

Cys 

Glu 

Val 

475 

Gly 

Gin 

Ser 

Thr 

Asp 

480 

Asp 

Met 

Ser 

Tyr 

Ser 

485 

Glu 

Leu 

Glu 

Val 

Gly 

490 

Ala 

Asp 

Asp 

Thr 

Ala 

495 

Thr 

Leu 

Asp 

Lys 

lie 

500 

lie 

Asp 

Ser 

Leu 

Thr 

505 

Ser 

Leu 

Ala 

Asn 

Glu 

510 

His 

Gly 

Gly 

Asp 

His 

515 

Asp 

Ala 

Gly 

Gin 

Glu 

520 

lie 

Glu 

Leu 

Ala 

Leu 

525 

Lys 

lie 

Gly 

Lys 

Val 

530 

Asn 

Glu 

Tyr 

Glu 

Thr 

535 

Asp 

Val 

Thr 

lie 

Asp 

540 

Lys 

Gly 

Gly 

Pro 

Lys 

545 

lie 

Leu 

lie 

Leu 

Gly 

550 

Ala 

Gly 

Arg 

Val 

Cys 

555 

Arg 

Pro 

Ala 

Ala 

Glu 

560 

Phe 

Leu 

Ala 

Ser 

Tyr 

565 

Pro 

Asp 

lie 

Cys 

Thr 

570 

Tyr 

Gly 

Val 

Asp 

Asp 

575 

His 

Asp 

Ala 

Asp 

Gin 

580 

lie 

His 

Val 

lie 

Val 

585 

Ala 

Ser 

Leu 

Tyr 

Gin 

590 

Lys 

Asp 
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Ala Glu Glu Thr Val 
595 

Asp Val Ala Asp lie 
610 

Val Val lie Ser Leu 
625 

Val Cys lie Glu Leu 
645 

Glu Ser Met Ser Asn 
660 

lie Leu Cys Glu Met 
675 


Met Lys Met lie Asp 
690 


Phe Thr Ser Tyr Cys 
705 


Pro Leu Ala Tyr Lys 
725 

Gly Lys Asn Pro Ala 
740 

Asp Gly His Asn Leu 
755 

Leu Pro Ala Phe Ala 
770 


Tyr Gly Asp Leu Tyr 
785 


Ala Thr Xaa Arg Tyr 
805 

Lys Thr Gly Phe Phe 
820 

Ser Arg Pro Thr Tyr 
835 

Ser Thr lie Asn Thr 
850 


Asp Asp Leu lie Ala 
865 

Glu lie Ala Val Lys 
885 

Glu Glu Thr Gin lie 
900 

Cys Gin Arg Met Glu 
915 

Met Val Leu Leu His 
930 


Asp Gly lie Glu Asn Thr 
600 

Gly Ser Leu Ser Asp Leu 
615 

Leu Pro Ala Ser Phe His 
630 635 

Lys Lys His Met Val Thr 
650 

Leu Ser Gin Ala Ala Lys 
665 

Gly Leu Asp Pro Gly lie 
680 

Glu Ala His Ala Arg Lys 
695 

Gly Gly Leu Pro Ser Pro 
710 715 

Phe Ser Trp Asn Pro Ala 
730 

Val Tyr Lys Phe Leu Gly 
745 

Tyr Glu Ser Ala Lys Arg 
760 

Leu Glu His Leu Pro Asn 
775 

Gly lie Ser Lys Glu Ala 
790 795 

Glu Gly Phe Ser Glu lie 
810 

Asp Ala Ala Asn His Pro 
825 

Lys Gly Phe Leu Asp Glu 
840 

Asp Leu Asp lie Glu Ala 
855 

Arg Leu Leu Lys Leu Gly 
870 875 

Thr Val Lys Thr lie Lys 
890 

Pro Lys Gly Cys Ser Ser 
905 

Gin Arg Met Ala Tyr Gly 
920 

His Glu Val Glu Val Glu 
935 


Thr Ala Thr Gin Leu 
605 


Val Ser Gin Val Glu 
620 


Ala Ala He Ala Gly 
640 

Ala Ser Tyr Val Asp 
655 

Asp Ala Gly Val Thr 
670 

Asp His Leu Met Ser 
685 


Gly Lys lie Lys Ala 
700 


Ala Ala Ala Asn Asn 
720 

Gly Ala Leu Arg Ser 
735 

Glu Thr lie His Val 
750 

Leu Arg Leu Arg Glu 
765 


Arg Asn Ser Leu lie 
780 


Ser Thr lie Tyr Arg 
800 

Met Val Thr Leu Ser 
815 

Leu Leu Gin Asp Thr 
830 

Leu Leu Asn Asn lie 
845 


Ser Gly Gly Tyr Asp 
860 


Cys Cys Lys Asn Lys 
880 

Phe Leu Gly Leu His 
895 

Pro Phe Asp Val lie 
910 

His Asn Glu Gin Asp 
925 


Tyr Pro Asp Gly Gin 
940 
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Pro Ala Glu Lys His Gin Ala Thr Leu Leu Glu Phe Gly Lys Val Glu 

945 950 955 960 

Asn Gly Arg Ser Thr Thr Ala Met Ala Leu Thr Val Gly lie Pro Ala 

965 970 975 

Ala lie Gly Ala Leu Leu Leu Leu Lys Asn Lys Val Gin Thr Lys Gly 
980 985 990 

Val He Arg Pro Leu Gin Pro Glu lie Tyr Val Pro Ala Leu Glu lie 
995 1000 1005 

Leu Glu Ser Ser Gly lie Lys Leu Val Glu Lys Val Glu Thr 
1010 1015 1020 

(2) INFORMATION FOR SEQ ID NO:123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(ix) FEATURE: 



(A) 

NAME/KEY: 

CDS 





(B) 

LOCATION: 

3..1908 





(xi) SEQUENCE DESCRIPTION: SEQ 

ID NO:123: 



ATTGTGCCCG 

CCTTCTGCTA 

G GAG GAG G CA 

AGAACGGACC 

TCGAGTAAAC 

CGGATTATTG 

60 

TGCAGCCAAG 

CACAAGGAGG 

AT C CAT CAT G 

ACGCTCAGTA 

TGAGGATGCA 

GGATGCGAGA 

120 

TTTCAGAAGA 

CCTGTCAGAA 

TGCGGCCTTA 

TCATAGGCAT 

CAAACAACCC 

AAGCTGCAGA 

180 

TGATTCTTTC 

AGATAGAGCG 

TACGCTTTCT 

T T T C AC AC AC 

ACACAAAGCC 

CAAAAAGAGA 

240 

ATATGCCACT 

GTTAGACAAG 

ATCCTTGAAG 

AAAGGGTGTC 

CTTGTTTGAT 

TATGAGCTAA 

300 

TTGTTGGAGA 

TGATGGGAAA 

AGATCACTAG 

CATTTGGGAA 

ATTTGCTGGT 

AGAGCTGGAC 

360 

TGATAGATTT 

CTTACATGGT 

CTCGGACAGC 

GATATTTGAG 

CCTTGGATAC 

TCGACTCCAT 

420 

TTCTCTCTCT 

GGGACAATCT 

CATATGTATC 

CTTCGCTCGC 

TGCAGCCAAG 

GCTGCAGTCA 

480 

TTGTCGTTGC 

AGAAGAGATA 

GCAACATTTG 

GACTTCCATC 

CGGAATTTGT 

CCGATAGTGT 

540 

TTGTGTTCAC 

TGGAGTTGGA 

AACGTCTCTC 

AGGGTGCGCA 

GGAGATATTC 

AAGTTATTGC 

600 

CCCATACCTT 

TGTTGATGCT 

GAGAAGCTTC 

CCGAAATTTT 

TCAGGCCAGG 

AATCTGTCTA 

660 

AGCAATCTCA 

GTCGACCAAG 

AGAGTATTTC 

AACTTTATGG 

TTGTGTTGTG 

ACCTCTAGAG 

720 

ACATAGTTTC 

TCACAAGGAT 

CCCACCAGAC 

AATTTGACAA 

AGGTGACTAT 

TATGCTCATC 

780 
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840 


CAGAACACTA CACCCCTGTT TTTCATGAAA GAATTGCTCC ATATGCATCT GTCATCGTAA 
ACTGTATGTA TTGGGAGAAG AGGTTTCCAC CATTACTAAA TATGGATCAG TTACAGCAAT 
TGATGGAGAC TGGTTGTCCT TTAGTCGGCG TTTGTGACAT AACTTGTGAT ATTGGAGGTT 
CCATTGAATT TATCAACAAG AGTACATCAA TAGAGAGGCC TTTCTTTCGG TATGATCCTT 
CTAAGAATTC ATACCATGAT GATATGGAAG GTGCCGGAGT GGTCTGCTTG GCTGTTGACA 
TTCTCCCTAC AGAATTCTCT AAAGAGGCCT CCCAACATTT TGGAAACATA CTATCTAGAC 
TTGTTGCTAG TTTGGCCTCA GTGAAGCAAC CGGCAGAACT TCCTTCCTAC TTGAGAAGAG 
CTTGCATTGC ACATGCTGGC AGATTAACTC CTTTGTATGA ATATATCCCT AGGATGAGAA 
ATACTATGAT AGATTTGGCA CCCGCAAAAA CAAATCCATT GCCTGACAAG AAGTATAGCA 
CCCTGGTATC TCTCAGTGGG CACCTATTTG ATAAGTTCCT TATAAATGAA GCTTTGGACA 
TCATTGAGAC AGCTGGAGGT TCATTTCACT TGGTTAGATG TGAAGTTGGA CAAAGCACGG 
ATGATATGTC AT AC T CAGAG CTTGAAGTAG GAGCAGATGA TACTGCCACA TTGGATAAAA 
TTATTGATTC CTTGACTTCT TTAGCTAATG AACATGGTGG AGATCACGAT GCCGGGCAAG 
AAATTGAATT AGCTCTGAAG ATAGGAAAAG TCAATGAGTA TGAAACTGAC GTCACAATTG 
ATAAAGGAGG GCCAAAGATT TTAATTCTTG GAGCTGGAAG AGTCTGTCGG CCAGCTGCTG 
AGTTTCTGGC ATCTTACCCA GACATATGTA CCTATGGTGT TGATGACCAT GATGCAGATC 
AAATTCATGT TATCGTGGCA TCTTTGTATC AAAAAGATGC AGAAGAGACA GTTGATGGTA 
TTGAAAATAC AACTGCTACC CAGCTTGATG TTGCTGATAT TGGAAGCCTT TCAGATCTTG 
TTTCTCAGGT TGAGGTTGTA ATTAGCTTGC TGCCTGCTAG TTTTCATG 
(2) INFORMATION FOR SEQ ID NO:124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea mays 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 

Cys Ala Arg Leu Leu Leu Gly Gly Gly Lys Asn Gly Pro Arg Val Asn 

1 5 10 15 

Arg He lie Val Gin Pro Ser Thr Arg Arg lie His His Asp Ala Gin 

20 25 30 


900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1908 
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Tyr 

Glu 

Asp 

35 

Ala 

Gly Cys 

Glu 

lie 

40 

Ser 

Glu 

Asp 

Leu 

Ser 

45 

Glu 

Cys 

Gly 

Leu 

lie 

50 

lie 

Gly 

lie 

Lys 

Gin 

55 

Pro 

Lys 

Leu 

Gin 

Met 

60 

lie 

Leu 

Ser 

Asp 

Arg 

65 

Ala 

Tyr 

Ala 

Phe 

Phe 

70 

Ser 

His 

Thr 

His 

Lys 

75 

Ala 

Gin 

Lys 

Glu 

Asn 

80 

Met 

Pro 

Leu 

Leu 

Asp 

85 

Lys 

lie 

Leu 

Glu 

Glu 

90 

Arg 

Val 

Ser 

Leu 

Phe 

95 

Asp 

Tyr 

Glu 

Leu 

lie 

100 

Val 

Gly Asp Asp 

Gly 

105 

Lys 

Arg 

Ser 

Leu 

Ala 

110 

Phe 

Gly 

Lys 

Phe 

Ala 

115 

Gly 

Arg 

Ala 

Gly 

Leu 

120 

lie 

Asp 

Phe 

Leu 

His 

125 

Gly 

Leu 

Gly 

Gin 

Arg 

130 

Tyr 

Leu 

Ser 

Leu 

Gly 

135 

Tyr 

Ser 

Thr 

Pro 

Phe 

140 

Leu 

Ser 

Leu 

Gly 

Gin 

145 

Ser 

His 

Met 

Tyr 

Pro 

150 

Ser 

Leu 

Ala 

Ala 

Ala 

155 

Lys 

Ala 

Ala 

Val 

lie 

160 

Val 

Val 

Ala 

Glu 

Glu 

165 

lie 

Ala 

Thr 

Phe 

Gly 

170 

Leu 

Pro 

Ser 

Gly 

lie 

175 

Cys 

Pro 

lie 

Val 

Phe 

180 

Val 

Phe 

Thr 

Gly 

Val 

185 

Gly Asn 

Val 

Ser 

Gin 

190 

Gly Ala 

Gin 

Glu 

lie 

195 

Phe 

Lys 

Leu 

Leu 

Pro 

200 

His 

Thr 

Phe 

Val 

Asp 

205 

Ala 

Glu 

Lys 

Leu 

Pro 

210 

Glu 

lie 

Phe 

Gin 

Ala 

215 

Arg 

Asn 

Leu 

Ser 

Lys 

220 

Gin 

Ser 

Gin 

Ser 

Thr 

225 

Lys 

Arg 

Val 

Phe 

Gin 

230 

Leu 

Tyr 

Gly 

Cys 

Val 

235 

Val 

Thr 

Ser 

Arg 

Asp 

240 

He 

Val 

Ser 

His 

Lys 

245 

Asp 

Pro 

Thr 

Arg 

Gin 

250 

Phe 

Asp 

Lys 

Gly Asp 
255 

Tyr 

Tyr 

Ala 

His 

Pro 

260 

Glu 

His 

Tyr 

Thr 

Pro 

265 

Val 

Phe 

His 

Glu 

Arg 

270 

lie 

Ala 

Pro 

Tyr 

Ala 

275 

Ser 

Val 

lie 

Val 

Asn 

280 

Cys 

Met 

Tyr 

Trp 

Glu 

285 

Lys 

Arg 

Phe 

Pro 

Pro 

290 

Leu 

Leu 

Asn 

Met 

Asp 

295 

Gin 

Leu 

Gin 

Gin 

Leu 

300 

Met 

Glu 

Thr 

Gly 

Cys 

305 

Pro 

Leu 

Val 

Gly 

Val 

310 

Cys 

Asp 

lie 

Thr 

Cys 

315 

Asp 

lie 

Gly 

Gly 

Ser 

320 

lie 

Glu 

Phe 

lie 

Asn 

325 

Lys 

Ser 

Thr 

Ser 

lie 

330 

Glu 

Arg 

Pro 

Phe 

Phe 

335 

Arg 

Tyr 

Asp 

Pro 

Ser 

340 

Lys 

Asn 

Ser 

Tyr 

His 

345 

Asp 

Asp 

Met 

Glu 

Gly Ala 
350 

Gly 

Val 

Val 

Cys 

355 

Leu 

Ala 

Val 

Asp 

lie 

360 

Leu 

Pro 

Thr 

Glu 

Phe 

365 

Ser 

Lys 

Glu 

Ala 

Ser 

370 

Gin 

His 

Phe 

Gly Asn 
375 

lie 

Leu 

Ser 

Arg 

Leu 

380 

Val 

Ala 

Ser 

Leu 
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Ala Ser Val Lys Gin Pro Ala Glu Leu Pro Ser Tyr Leu Arg Arg Ala 
385 390 395 400 


Cys lie Ala His Ala Gly Arg Leu Thr Pro Leu Tyr Glu Tyr lie Pro 
405 410 415 

Arg Met Arg Asn Thr Met lie Asp Leu Ala Pro Ala Lys Thr Asn Pro 
420 425 430 

Leu Pro Asp Lys Lys Tyr Ser Thr Leu Val Ser Leu Ser Gly His Leu 
435 440 445 

Phe Asp Lys Phe Leu lie Asn Glu Ala Leu Asp lie lie Glu Thr Ala 
450 455 460 

Gly Gly Ser Phe His Leu Val Arg Cys Glu Val Gly Gin Ser Thr Asp 

465 470 475 480 

Asp Met Ser Tyr Ser Glu Leu Glu Val Gly Ala Asp Asp Thr Ala Thr 

485 490 495 

Leu Asp Lys lie lie Asp Ser Leu Thr Ser Leu Ala Asn Glu His Gly 
500 , 505 510 

Gly Asp His Asp Ala Gly Gin Glu lie Glu Leu Ala Leu Lys lie Gly 
515 520 525 

Lys Val Asn Glu Tyr Glu Thr Asp Val Thr lie Asp Lys Gly Gly Pro 
530 535 540 

Lys lie Leu lie Leu Gly Ala Gly Arg Val Cys Arg Pro Ala Ala Glu 
545 550 555 560 

Phe Leu Ala Ser Tyr Pro Asp lie Cys Thr Tyr Gly Val Asp Asp His 
565 570 575 

Asp Ala Asp Gin lie His Val lie Val Ala Ser Leu Tyr Gin Lys Asp 
580 585 590 

Ala Glu Glu Thr Val Asp Gly lie Glu Asn Thr Thr Ala Thr Gin Leu 
595 600 605 

Asp Val Ala Asp lie Gly Ser Leu Ser Asp Leu Val Ser Gin Val Glu 
610 615 620 

Val Val lie Ser Leu Leu Pro Ala Ser Phe His Ala Ala lie Ala Gly 

625 630 635 640 

(2) INFORMATION FOR SEQ ID NO:125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 


195 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2..720 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 215 

(D) OTHER INFORMATION: /label= unknown 
(ix) FEATURE: 



(A) 

NAME/KEY: 

misc feature 




(B) 

LOCATION: 

678 





(D) 

OTHER INFORMATION: /label= unknown 



(xi) SEQUENCE DESCRIPTION: SEQ 

ID NO:125: 



GTTTAAACAT 

CTTTCCAATC 

TTGTTTCTCA 

GGTTGAAGTA 

GTAGTTAGCT 

TGCTGCCTGC 

60 

CAGTTTTCAT 

GCTGCCATAG 

CAAGAGTATG 

CATAGAGATG 

AAGAAGCACT 

TGGTCACTGC 

120 

AAGCTATGTT 

GATGAGTCCA 

TGTCAAAGTT 

GGAACAATCT 

GCAGAAGGTG 

CTGGTGTAAC 

180 

TATTCTCTGT 

GAAATGGGCC 

TGGATCCTGG 

CATANATCAT 

ATGATGTCAA 

TGAAGATGAT 

240 

TGACGAAGCA 

CATTCACGGA 

AGGGGAAAAT 

AAAGTCATTT 

ACATCCTTTT 

GTGGAGGACT 

300 

TCCATCTCCA 

GCTTCTGCAA 

ACAATCCACT 

TGCTTATAAG 

TTCAGTTGGA 

GTCCAGCTGG 

360 

TGCCATCCGT 

GCAGGGAGAA 

ACCCTGCTGT 

CTACAAATTT 

CAT G GAGAAA 

TCATCCATGT 

420 

AGATGGTGAT 

AAATTGTATG 

AATCCGCAAA 

GAGGCTCAGA 

TTACMAGAAC 

TTCCAGCTTT 

480 

TGCACTGGAA 

CACTTGCCAA 

ACCGGAATTC 

CTTGATGTAT 

GGAGACCTGT 

ATGGGATCTC 

540 

CAAAGAAGCA 

TCTACTGTGT 

ACAGGGCTAC 

TCTTCGTTAT 

GAAGGATTTA 

ATGAGATAAT 

600 

GGCAACCTTC 

GCGAAAATTG 

GGTTTTTTGA 

TGCTGCAAGT 

CATCCACTGT 

TGCAACAAAC 

660 

TACTCGCCCT 

ACATACANGG 

ATTTCCTGTT 

GAACCCTCAA 

TGCTTGTACA 

TCTCCAAAAC 

720 


(2) INFORMATION FOR SEQ ID NO:126: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein. 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

Phe Lys His Leu Ser Asn Leu Val Ser Gin Val Glu Val Val Val Ser 

1 5 10 15 

Leu Leu Pro Ala Ser Phe His Ala Ala lie Ala Arg Val Cys lie Glu 
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20 


25 


30 


Met Lys Lys His Leu Val Thr Ala Ser Tyr Val Asp Glu Ser Met Ser 
35 40 45 

Lys Leu Glu Gin Ser Ala Glu Gly Ala Gly Val Thr lie Leu Cys Glu 
50 55 60 

Met Gly Leu Asp Pro Gly lie Xaa His Met Met Ser Met Lys Met lie 
65 70 75 80 

Asp Glu Ala His Ser Arg Lys Gly Lys lie Lys Ser Phe Thr Ser Phe 
85 90 95 

Cys Gly Gly Leu Pro Ser Pro Ala Ser Ala Asn Asn Pro Leu Ala Tyr 
100 105 110 

Lys Phe Ser Trp Ser Pro Ala Gly Ala lie Arg Ala Gly Arg Asn Pro 
115 120 125 

Ala Val Tyr Lys Phe His Gly Glu lie lie His Val Asp Gly Asp Lys 
130 135 140 

Leu Tyr Glu Ser Ala Lys Arg Leu Arg Leu Xaa Glu Leu Pro Ala Phe 

145 150 155 160 

Ala Leu Glu His Leu Pro Asn Arg Asn Ser Leu Met Tyr Gly Asp Leu 

165 170 175 

Tyr Gly lie Ser Lys Glu Ala Ser Thr Val Tyr Arg Ala Thr Leu Arg 
180 185 ' 190 

Tyr Glu Gly Phe Asn Glu lie Met Ala Thr Phe Ala Lys lie Gly Phe 
195 200 205 

Phe Asp Ala Ala Ser His Pro Leu Leu Gin Gin Thr Thr Arg Pro Thr 
210 215 220 

Tyr Xaa Asp Phe Leu Leu Asn Pro Gin Cys Leu Tyr lie Ser Lys 

225 230 235 

(2) INFORMATION FOR SEQ ID NO:127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
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CTGCTGTTGC TCCAGAACAA GATCCAAAAG AAAGGAGTGA TCAGGCCTCT GGAACCTGAA 60 

ATTTACATTC CAGCGTTGGA GATCTTGGAG TCATCGGGTA TCAAGCTGGC GGAGAGAGTG 120 

GAGACCTGAG AATCGGACCC AATATGTATA ATGTAGCATG GTGGTAGCTT CTCTATATAT 180 

ATGCTTCAGT GAATAATTGA TTTGCCGTTG TGTGGTAATT AAGCAATGCC CGCTAATAAA 240 

TTGTACCGTA GAAGTCCTTC TATGTACATC CGTATCAAAA AATAAAAAAA GCATCGATTA 300 

GCTTGAAT 308 

(2) INFORMATION FOR SEQ ID NO:128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

Leu Leu Leu Leu Gin Asn Lys lie Gin Lys Lys Gly Val lie Arg Pro 
iS 10 15 

Leu Glu Pro Glu He Tyr lie Pro Ala Leu Glu lie Leu Glu Ser Ser 
20 25 30 

Gly lie Lys Leu Ala Glu Arg Val Glu Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO:129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Triticum aestivum 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..252 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 172 
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(D) OTHER INFORMATION: /label= unknown 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 186 

(D) OTHER INFORMATION: /label= unknown 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 331 

(D) OTHER INFORMATION: /label= unknown 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: 


TACCCCGACG 

GGGACCCCAC 

CGAGAAGCAC 

CAAGCGACGC 

TGCTGGAGTT 

CGGAAAGACC 

60 

GAGAACGGCA 

GGCCCACCAC 

CGCCATGGCC 

CTCACCGTTG 

GGGTACCGGC 

AGCGATAGGA 

120 

GCCCTGCTCT 

TGCTCCAGAA 

CAAGGTCCAG 

AGGAAAGGGG 

TGATCCGGCC 

TNTGGAACCG 

180 

GAGAT N TACA 

TCCCTGCGCT 

GGAGATCTTG 

GAAGCGTCGG 

GCATCAAGCT 

GAT C GAGAGA 

240 

GTGGAGACCT 

GAGGATGTCA 

GGATGGGATG 

AGAATCTATC 

GAGTATATAT 

GCTGCAGCAA 

300 

CAGAGGCAGT 

GAGTAAATAA 

AATGATGATT 

NTCGCCGTTG 

TAAGTAAAAT 

GAGTGGACTG 

360 

TATGTATGTA 

TGTGACTATC 

TATTGTACTA 

CATATATACC 

AAATCTGTCG 

CCGGTTGATT 

420 

CTGTTGGTG 






429 

(2) 

INFORMATION 

FOR SEQ ID 

NO:130: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: 

protein 





(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE:. 

(A) ORGANISM: Triticum aestivum 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 


Tyr 

1 

Pro 

Asp 

Gly Asp 
5 

Pro 

Thr 

Glu 

Lys 

His 

10 

Gin 

Ala 

Thr 

Leu 

Leu 

15 

Glu 

Phe 

Gly 

Lys 

Thr 

20 

Glu 

Asn 

Gly 

Arg 

Pro 

25 

Thr 

Thr 

Ala 

Met 

Ala 

30 

Leu 

Thr 

Val 

Gly 

Val 

35 

Pro 

Ala 

Ala 

lie 

Gly 

40 

Ala 

Leu 

Leu 

Leu 

Leu 

45 

Gin 

Asn 

Lys 

Val 

Gin 

50 

Arg 

Lys 

Gly 

Val 

lie 

55 

Arg 

Pro 

Xaa 

Glu 

Pro 

60 

Glu 

Xaa 

Tyr 

lie 

Pro 

65 

Ala 

Leu 

Glu 

lie 

Leu 

70 

Glu 

Ala 

Ser 

Gly 

He 

75 

Lys 

Leu 

lie 

Glu 

Arg 

80 
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Val Glu Thr 


(2) INFORMATION FOR SEQ ID NO:131: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: 

DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ 

ID NO:131: 



ATGACGAAAA 

AATCAGGTGT 

TTTGATTCTT 

GGTGCTGGAC 

GTGTGTGTCG 

CCCAGCTGCT 

60 

GATTTCCTAG 

CTTCAGTTAG 

AACCATTTCG 

TCACAGCAAT 

GGTACAAAAC 

ATATTTCGGA 

120 

GCAGACTCTG 

AAGAGAAAAC 

AGATGTTCAT 

GTGATTGTCG 

CGTCTCTGTA 

TCTTAAGGAT 

180 

GCCAAAGAGA 

CGGTTGAAGG 

TATTTCAGAT 

GTAGAAGCAG 

TTCGGCTAGA 

TGTATCTGAT 

240 

AGTGAAAGTC 

TCCTTAAGTA 

TGTTTCTCAG 

GTTGATGTTG 

TCCTAAGTTT 

ATTACCTGCA 

300 

AGTTGTCATG 

CTGTTGTAGC 

AAAGACATGC 

ATTGAGCTGA 

AGAAG CAT C T 

CGTCACTGCT 

360 

AGCTATGTTG 

ATGATGAAAC 

GTCCATGTTA 

CATGAGAAGG 

CTAAGAGTGC 

TGGGATAACG 

420 

ATTCTAGGCG 

AAATGGGACT 

GGACCCTGGA 

ATCGATCACA 

TGATGGCGAT 

GAAAATGATC 

480 

AACGATGCTC 

ATAT CAAAAA 

AGGGAAAGTG 

AAGTCTTTTA 

CCTCTTATTG 

TGGAGGGCTT 

540 

CCCTCTCCTG 

CTGCAGCAAA 

TAATCCATTA 

GCATATAAAT 

TTAGCTGGAA 

CCCTGCTGGA 

600 

GCAATTCGAG 

CTGGTCAAAA 

CCCCGCCAAA 

TACAAAAGCA 

ACGGCGACAT 

AATACATGTT 

660 

GATGGGAAGA 

ATCTCTATGA 

TTCCGCGGCA 

AGATTCCGAG 

TACCTAATCT 

TCCAGCTTTT 

720 

GCATTGGAGT 

GTTTTCCAAA 

TCGTGACTCC 

TTGGTTTACG 

GGGAACATTA 

TGGCATCGAG 

780 

AGCGAAGCAA 

CAACGATATT 

TCGTGGAACA 

C T CAGATAT G 

AAGGGTTTAG 

TATGATAATG 

840 

GCAACACTTT 

CGAAACTTGG 

ATTCTTTGAC 

AGTGAAGCAA 

ATCAAGTACT 

CTCCACTGGA 

900 

AAGAGGATTA 

CGTTTGGTGC 

TCTTTTAAGT 

AACATTCTAA 

ATAAGGATGC 

AGACAATGAA 

960 

TCAGAGCCCC 

TAGCGGGAGA 

AGAAGAGATA 

AGCAAGAGAA 

TTATCAAGCT 

TGGACATTCC 

1020 

AAGGAGACTG 

CAGCCAAAGC 

TGCCAAAACA 

ATTGTATTCT 

TGGGGTTCAA 

CGAAGAGAG G 

1080 

GAGGTTCCAT 

CACTGTGTAA 

AAGCGTATTT 

GATGCAACTT 

GTTACCTAAT 

GGAAGAGAAA 

1140 

CTAGCTTATT 

CCGGAAATGA 

ACAGGACATG 

GTGCTTTTGC 

ATCACGAAGT 

AGAAGTGGAA 

1200 

TTCCTTGAAA 

GCAAACGTAT 

AGAGAAGCAC 

ACTGCGACTC 

TTTTGGAATT 

CGGGGACATC 

1260 

AAGAATGGAC 

AAACAACAAC 

CGCTATGGCC 

AAGACTGTTG 

GGATCCCTGC 

AGCCATTGGA 

1320 
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1380 


GCTCTGGTGT TAATTGAAGA CAAGATCAAG ACAAGAGGAG TCTTAAGGCC TCTCGAAGCA 
GAGGTGTATT TGCCAGCTTT GGATATATTG CAAGCATATG GTATAAAGCT GATGGAGAAG 
GCAGAATGA 


(2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 amino acids 

(B) TYPE: amino acid 

C C) STRANDEDNESS: unknown 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

Met Thr Lys Lys Ser Gly Val Leu lie Leu Gly Ala Gly Arg Val Cys 
15 10 15 

Arg Pro Ala Ala Asp Phe Leu Ala Ser Val Arg Thr lie Ser Ser Gin 
20 25 30 

Gin Trp Tyr Lys Thr Tyr Phe Gly Ala Asp Ser Glu Glu Lys Thr Asp 
35 40 45 

Val His Val lie Val Ala Ser Leu Tyr Leu Lys Asp Ala Lys Glu Thr 
50 55 60 

Val Glu Gly lie Ser Asp Val Glu Ala Val Arg Leu Asp Val Ser Asp 
65 70 75 80 

Ser Glu Ser Leu Leu Lys Tyr Val Ser Gin Val Asp Val Val Leu Ser 
85 90 95 

Leu Leu Pro Ala Ser Cys His Ala Val Val Ala Lys Thr Cys lie Glu 
100 105 HO 

Leu Lys Lys His Leu Val Thr Ala Ser Tyr Val Asp Asp Glu Thr Ser 
115 120 125 

Met Leu His Glu Lys Ala Lys Ser Ala Gly He Thr lie Leu Gly Glu 
130 135 140 

Met Gly Leu Asp Pro Gly lie Asp His Met Met Ala Met Lys Met lie 

145 150 155 160 

Asn Asp Ala His lie Lys Lys Gly Lys Val Lys Ser Phe Thr Ser Tyr 

165 170 175 

Cys Gly Gly Leu Pro Ser Pro Ala Ala Ala Asn Asn Pro Leu Ala Tyr 
180 185 190 


Lys Phe Ser Trp Asn Pro Ala Gly Ala lie Arg Ala Gly Gin Asn Pro 
195 200 205 


1440 

1449 
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Ala Lys Tyr Lys Ser Asn Gly Asp 
210 215 

Leu Tyr Asp Ser Ala Ala Arg Phe 
225 230 

Ala Leu Glu Cys Phe Pro Asn Arg 
245 

Tyr Gly lie Glu Ser Glu Ala Thr 
260 

Tyr Glu Gly Phe Ser Met lie Met 
275 280 

Phe Asp Ser Glu Ala Asn Gin Val 
290 295 

Phe Gly Ala Leu Leu Ser Asn lie 
305 310 

Ser Glu Pro Leu Ala Gly Glu Glu 
325 

Leu Gly His Ser Lys Glu Thr Ala 
340 

Phe Leu Gly Phe Asn Glu Glu Arg 
355 360 

Val Phe Asp Ala Thr Cys Tyr Leu 
370 375 

Gly Asn Glu Gin Asp Met Val Leu 
385 390 

Phe Leu Glu Ser Lys Arg lie Glu 
405 

Phe Gly Asp lie Lys Asn Gly Gin 
420 

Val Gly lie Pro Ala Ala lie Gly 
435 440 

lie Lys Thr Arg Gly Val Leu Arg 
450 455 

Pro Ala Leu Asp lie Leu Gin Ala 
465 470 

Ala Glu 



lie lie His Val Asp Gly Lys Asn 
220 

Arg Val Pro Asn Leu Pro Ala Phe 
235 240 

Asp Ser Leu Val Tyr Gly Glu His 
250 255 

Thr lie Phe Arg Gly Thr Leu Arg 
265 270 

Ala Thr Leu Ser Lys Leu Gly Phe 
285 

Leu Ser Thr Gly Lys Arg lie Thr 
300 

Leu Asn Lys Asp Ala Asp Asn Glu 
315 320 

Glu lie Ser Lys Arg lie lie Lys 
330 335 

Ala Lys Ala Ala Lys Thr lie Val 
345 350 

Glu Val Pro Ser Leu Cys Lys Ser 
365 

Met Glu Glu Lys Leu Ala Tyr Ser 
380 

Leu His His Glu Val Glu Val Glu 
395 400 

Lys His Thr Ala Thr Leu Leu Glu 
410 415 

Thr Thr Thr Ala Met Ala Lys Thr 
425 430 

Ala Leu Val Leu lie Glu Asp Lys 
445 

Pro Leu Glu Ala Glu Val Tyr Leu 
460 

Tyr Gly lie Lys Leu Met Glu Lys 
475 480 
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